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PURIFIED THERMOSTABLE PYROCOCCUS FURTOSUS 
DNA POLYMERASE I 

Technical Field 
5 The present invention relates to a thermostable 

enzyme having DNA polymerase I activity useful in 
nucleic acid synthesis by primer extension reaction. 

Background 

10 The archaebacteria are a recently discovered 

group of microorganisms that grow optimally at 
temperatures above 80 °C. Some 20 species of these 
extremely thermophilic bacteria-like organisms have 
been isolated, mainly from shallow submarine and deep 

15 sea geothermal environments. Most of the 

archaebacteria are strict anaerobes and depend on the 
reduction of elemental sulfur for growth. 

The archaebacteria include a group of 
"hyperthermophiles" that grow optimally around 100 °C. 

20 These are presently represented by three distinct 
genera, Pyrodictium, Pyrococcus, and Pyrobaculum. 
Phyodictium brocKU (T^ 105 «C) is an obligate 
autotroph which obtains energy be reducing S° to HgS 
with H2, while Pyrobaculum islandicum (T^ 100°C) is a 

25 faculative heterotroph that uses either organic 
substrates or H 2 to reduce S°. In contrast, 
Pyrococciis furiosus (T^ 100 °C) grows by a 
fermentative-type metabolism rather than by S° 
respiration. It is a strict heterotroph that utilizes 

30 both simple and complex carbohydrates where only H 2 
and C0 2 are the detectable products. The organism 
reduces elemental sulfur to H 2 S apparently as a form 
of detoxification since H 2 inhibits growth. 

The discovery of microorganisms growing optimally 

35 around 100 °C has generated considerable interest in 



both academic and industrial communities. Both the 
organisms and their enzymes have the potential to 
bridge the gap between biochemical catalysis and many 
industrial chemical conversions. However, knowledge 
of the metabolism of the hyperthermophilic 
microorganisms is presently very limited. 

The polymerase chain reaction (PCR) is a powerful 
method for the rapid and exponential amplification of 
target nucleic acid sequences. PCR has facilitated 
the development of gene characterization and molecular 
cloning technologies including the direct sequencing 
of PGR amplified DNA, the determination of allelic 
variation, and the detection of infectious and genetic 
disease disorders. PCR is performed by repeated 
cycles of heat denaturation of a DNA template 
containing the target sequence, annealing of opposing 
primers to the complementary DNA strands, and 
extension of the annealed primers with a DNA 
polymerase. Multiple PCR cycles result in the 
exponential amplification of the nucleotide sequence 
delineated by the flanking amplification primers. 

An important modification of the original PCR 
technique was the substitution of Thermus acruaticus 
(Taq) DNA polymerase in place of the Klenow fragment 
of E. coli DNA pol I (Saiki, et al. Science. 230:1350- 
1354 (1988)). The incorporation of a thermostable DNA 
polymerase into the PCR protocol obviates the need for 
repeated enzyme additions and permits elevated 
annealing and primer extension temperatures which 
enhance the specificity of primer: template 
associations. Taq polymerase thus serves to increase 
the specificity and simplicity of PCR. 

Although Taq polymerase is used in the vast 
majority of PCR performed today, it has a fundamental 
drawback: purified Taq DNA polymerase enzyme is 
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devoid of 3 1 to 5' exonuclease activity and thus 
cannot excise misinserted nucleotides (Tindall, et 
al., Biochemistry , 29:5226-5231 (1990)). Several 
independent studies suggest that 3' to 5' 
5 exonuclease-dependent proofreading enhances the 

fidelity of DNA synthesis. Reyland et al, J. Biol. 
Chem. , 263:6518-6524, 1988; Kunkel et al, J. Biol. 
Chem. . 261:13610-13616. 1986; Bemad et al, Cell . 
58:219-228, 1989. Consistent with these findings, the 

10 observed error rate (mutations per nucleotide per 

cycle) of Taq polymerase is relatively high; estimates 
range from 2 X 10' 4 during PCR (Saiki et al., Science . 
239:487-491 (1988); Keohavaong et al. Proc. Natl. 
Acad. Sci. USA . 86:9253-9257 (1989)) to 2 X 10~ 5 for 

15 base substitution errors produced during a single 
round of DNA synthesis of the lacZ gene (Eckert et 
al., Nucj. AcA4s ReSi# 18:3739-3744 (1990)). 

Polymerase induced mutations incurred during PCR 
increase arithmetically as a function of cycle number. 

20 For example, if an average of two mutations occur 

during one cycle of amplification, 20 mutations will 
occur after 10 cycles and 40 will occur after 20 
cycles. Each mutant and wild type template DNA 
molecule will be amplified exponentially during PCR 

25 and thus a large percentage of the resulting 
amplification products will contain mutations. 
Nutations introduced by Tag polymerase during DNA 
amplification have hindered PCR applications which 
require high fidelity DNA synthesis. 

30 

Summary of the Invention 

A thermostable DNA polymerase from the 
hyperthermophilic, marine archaebacterium, Pyrococcus 
furiosus (Pfu) has been discovered. The monomer ic, 
35 multifunctional enzyme possesses both DNA polymerase 
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and 3' to 5 1 exonuclease activities. The polymerase 
is extremely thermostable with a temperature optimum 
near 75 °C. The purified enzyme functions effectively 
in the polymerase chain reaction (PCR) . In addition, 
5 results from PGR fidelity studies indicate that 
Pyrococcus furiosus DNA polymerase yields 
amplification products containing 12 fold less 
mutations than reaction products from similar 
amplifications performed with Tag DNA polymerase. The 

10 3 1 to 5* exonuclease dependent proofreading activity 
of Pfu DNA polymerase will excise mismatched 3 1 
terminal nucleotides from primer: template complexes 
and correctly incorporate nucleotides complementary to 
the template strand. 

15 Unlike Taq DNA polymerase, Pfu DNA polymerase 

does not possess 5" to 3 r exonuclease activity. Pfu, 
like Taq and Vent polymerases, does exhibit a 
polymerase dependent 5 1 to 3' strand displacement 
activity* Pfu DNA polymerase remains greater that 95% 

20 active after one hour incubation at 95 °C In 

contrast. Vent polymerase [New England Biolabs (NEB) 
Beverly, MA] looses greater than 50% of its polymerase 
activity after one hour incubation at 95 °C Pfu DNA 
polymerase is thus unexpectedly superior to Taq and 

25 Vent DNA polymerases in amplification protocols 
requiring high fidelity DNA synthesis. 

Thus, the present invention contemplates a 
purified thermostable P. furiosus DNA polymerase I 
(Pfu DNA Pol I or Pyro polymerase) having an amino 

30 terminal amino acid residue sequence represented by 
the formula shown in SEQ ID NO 1, having 775 amino 
acid residues. 

The apparent molecular weight of the native 
protein is about 90,000-93,000 daltohs as determined 

35 by SDS-PAGE under non-denaturing (non-reducing) 



conditions using Tag polymerase as a standard having a 
molecular weight of 94,000 daltons. In preferred 
embodiments, the Pyro polymerase is isolated from P. 
f uriosus . and more preferably has a specific 3' to 5' 
exonuclease activity. 

Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure : 

Figure 1 illustrates the molecular weight 
determination of Pyro DNA polymerase compeared to Tag 
DNA polymerase by SDS-PAGE analysis (8-16% gradient 
gel, under non-reducing conditions). High molecular 
weight standards (Sigma Chem. Co., St. Louis, HO) are 
electrophoresed in lanes 1 and 6. The molecular 
weights of Phosphorylase B (97,200 daltons) and bovine 
serum albumin (66,000 daltons) in lane 6 are indicated 
by arrows. Sequencing grade Taq DNA polymerase which 
has been modified (Pr omega Biotech, Madison, WI) has 
an apparent molecular weight of 80,000 daltons (lane 
2) . Taq DNA polymerases from Cetus (Emeryville, CA) 
and Stratagene (La Jolla, CA) are electrophoresed in 
lanes 3 and 4, respectively, each with an apparent 
molecular weight of 94,000 daltons. In lane 5, Pyro 
DNA polymerase exhibits a molecular weight of 90,000 - 
93,000 daltons. 

Detailed Description of the Invention 
A. Definitions 

As used herein, "cell", "cell line", and "cell 
culture" can be used interchangeably and all such 
designations include progeny. Thus, the words 
"transformants" or "transformed cells" includes the 
primary subject cell and cultures derived therefrom 
without regard for the number of transfers. It is 
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also understood that all progeny may not be precisely 
identical in DNA content, due to deliberate or 
inadvertent mutations. Mutant progeny that have the 
same functionality as screened for in the originally 
5 transformed cell are included. 

The term "control sequences" refers to DNA 
sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. 
The control sequences that are suitable for 
10 procaryotes, for example, include a promoter, 

optionally an operator sequence, a ribosome binding 
site, and the like. Eucaryotic cells are known to 
utilize promoters, polyadenylation signals, and 
enhancers. 

15 The term "expression system" refers to DNA 

sequences containing a desired coding sequence and 
control sequences in operable linkage, so that hosts 
transformed with these sequences are capable of 
producing the encoded proteins. In order to effect 

20 transformation, the expression system may be included 
on a vector; however, the relevant DNA may then also 
be integrated into the host chromosome. 

The term "gene" as used herein refers to a DNA 
sequence that encodes a polypeptide. 

25 "Operably linked" refers to juxtaposition such 

that the normal function of the components can be 
performed. Thus, a coding sequence "operably linked" 
to control sequences refers to a configuration wherein 
the coding sequences can be expressed under the 

30 direction of the control sequences. 

The term "oligonucleotide" as used herein is 

c 

defined as a molecule comprised of two or more 
debxyribonucleotides and/or ribonucleotides, 
preferably more than three. Its exact size will 
35 depend on many factors, which in turn depend on the 
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ultimate function or use of the oligonucleotide. The 
oligonucleotide may be derived synthetically or by 
cloning. 

The term "primer" as used herein refers to an 
5 oligonucleotide, whether occurring naturally or 

produced synthetically, which is capable of acting as 
a point of initiation of nucleic acid synthesis when 
placed under conditions in which synthesis of a primer 
extension product which is complementary to a nucleic 
10 acid strand is induced, i.e., in the presence of four 
different nucleotide triphosphates and thermostable 
enzyme in an appropriate buffer (•buffer" includes pH, 
ionic strength, cof actors, etc.) and at a suitable 
temperature. For Pyro polymerase, the buffer herein 
15 preferably contains 1.5-2 mM of a magnesium salt, 

preferably MgCl 2 , 150-200 jiM of each nucleotide, and 1 
uM of each primer, along with preferably 50 mM KC1, 20 
mM Tris buffer, pH 8-8.4, and 100 /jg/ml gelatin. 
The primer is preferably single-stranded for 
20 maximum efficiency in amplification, but may 

alternatively be double-stranded. If double-stranded, 
the primer is first treated to separate its strands 
before being used to prepare extension products. 
Preferably, the primer is an oligodeoxyribonucleotide. 
25 The primer must be sufficiently long to prime the 

synthesis of extension products in the presence of the 
thermostable enzyme. The exact lengths of the primers 
will depend on many factors, including temperature, 
source of primer and use of the method. For example, 
30 depending on the complexity of the target sequence, 
the oligonucleotide primer typically contains 15-25 
nucleotides, although it may contain more or few 
nucleotides. Short primer molecules generally require 
colder temperatures to form sufficiently stable hybrid 
35 complexes with template. 



The primers herein are selected to be 
"substantially" complementary to the different strands 
of each specific sequence to be amplified. This means 
that the primers must be sufficiently complementary to 
hybridize with their respective strands. Therefore, 
the primer sequence need not reflect the exact 
sequence of the template. For example, a non- 
complementary nucleotide fragment may be attached to 
the 5 1 end of the primer, with the remainder of the 
primer sequence being complementary to the strand. 
Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, 
provided that the primer sequence has sufficient 
complementarity with the sequence of the strand to be 
amplified to hybridize therewith and thereby f orm a 
template for synthesis of the extension product of the 
other primer. However, for detection purposes, 
particularly using labeled sequence-specific probes, 
the primers typically have exact complementarity to 
obtain the best results. 

As used herein, the term "thermostable enzyme" 
refers to an enzyme which is stable to heat and is 
heat resistant and catalyzes (facilitates) combination 
of the nucleotides in the proper manner to form the 
primer extension products that are complementary to 
each nucleic acid strand. Generally, the synthesis 
will be initiated at the 3« end of each primer and 
will proceed in the 5 1 direction along the template 
strand, until synthesis terminates, producing 
molecules of different lengths. 

The thermostable enzyme herein must satisfy a 
single criterion to be effective for the amplication 
reaction, i.e., the enzyme must not become 
irreversibly denatured (inactivated) When subjected to 
the elevated temperatures for the time necessary to 



WO 92/09689 



PCT/US91/09026 



effect denaturation of double-stranded nucleic acids. 
Irreversible denaturation for purposes herein refers 
to permanent and complete loss of enzymatic activity. 
The heating conditions necessary for denaturation will 
5 depend, e.g., on the buffer salt concentration and the 
length and nucleotide composition of the nucleic acids 
being denatured, but typically range from about 90 to 
about 96 °C for a time depending mainly on the 
temperature and the nucleic acid length, typically 

10 about 0*5 to four minutes. Higher temperatures may be 
tolerated as the buffer salt concentration and/or GC 
composition of the nucleic acid is increased. 
Preferably, the enzyme will not become irreversibly 
denatured at about 90-100 °C. 

15 The thermostable enzyme herein preferably has an 

optimum temperature at which it functions that is 
higher than about 40°C, which is the temperature below 
which hybridization of primer to template is promoted, 
although, depending on (1) magnesium and salt, 

20 concentrations and (2) composition and length of 

primer, hybridization can occur at higher temperature 
(e.g., 45-70°C) . The higher the temperature optimum 
for the enzyme, the greater the specificity and/or 
selectivity of the primer-directed extension process. 

25 However, enzymes that are active below 40 °C, e.g., at 
37 °C, are also with the scope of this invention 
provided they are heat-stable. Preferably, the 
optimum temperature ranges from about 50° to 90 °C, 
more preferably 60-8 0°c. 

30 Amino Acid Residue : The amino acid residues 

described herein are preferred to be in the "L" 
isomeric form. However, residues in the "D" isomeric 
form can be substituted for any L-amino acid residue, 
as long as the desired functional property is retained 

35 by the polypeptide. NH2 refers to the free amino 
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group present at the amino- or carboxy- terminus of a 
polypeptide. CO OH refers to the free carboxy group 
present at the carboxy terminus of a polypeptide. The 
amino-terminal NH 2 group and carboxy-terminal COOH 
5 group of free polypeptides are typically not set forth 
in a formula. A hyphen at the amino- or car boxy- 
terminus of a sequence indicates the presence of a 
further sequence of amino acid residues or a 
respective NH 2 or COOH terminal group. In keeping 
10 with standard polypeptide nomenclature, J. Biol. 

Chem. , 243:3552-59 (1969) and adopted at 37 CFR 
SI. 822 (b) (2) , abbreviations for amino acid residues 
are shown in the following Table of Correspondence: 

15 TABLE OF CORRESPONDENCE 

SYMBOL AMINO ACID 

l-Letter 3 -Letter 



y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 



N Asn asparagine 

C Cys cysteine 

It should be noted that all amino acid residue 
sequences are represented herein by formulae whose 
left and right orientation is in the conventional 
direction of amino-terminus to carboxy-terminus . 

Nucleotide : a monomer ic unit of DNA or RNA 
consisting of a sugar moiety (pentose) , a phosphate, 
and a nitrogenous heterocyclic base. The base is 
linked to the sugar moiety via the glycosidic carbon 
(l 1 carbon of the pentose) and that combination of 
base and sugar is a nucleoside. When the nucleoside 
contains a phosphate group bonded to the 3' or 5' 
position of the pentose it is referred to as a 
nucleotide. A sequence of operatively linked 
nucleotides is typically referred to herein as a "base 
sequence" or "nucleotide sequence" , and is represented 
herein by a formula whose left to right orientation is 
in the conventional direction of 5 •-terminus to 3'- 
terminus . 

Base Pair (bp) : A partnership of adenine (A) 
with thymine (T) , or of cytosine (C) with guanine (G) 
in a double stranded DNA molecule. 

B. Pfu DNA Polymerase I fPvro Polymerase! 

Pyro polymerase, the thermostable DNA polymerase 
of the present invention, can be obtained from any 
source and can be a native or recombinant protein. A 
preferred Pyro polymerase is isolated from Pvrococcus 
furiosus. P. furiosus is available from Dentsche 
Sammlung Von Microorganismen (DSM) , Grise-Bach StraSSE 
8, d-3400 GSttengen, FRG, under the accession number 
DSM-6217. 

For isolating the native protein from P. furiosus 
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cells, such cells are grown using any suitable 
technique. A variety of such techniques have been 
reported, those preferred being described by Fiala et 
al„ Arch, Microbiol , (1986) 145:56-61, and Bryant et 
5 al- . J- Biol. Chem. . (1989) 264:5070-5079, the 

disclosures of which are incorporated herein by 
reference. 

After cell growth, the isolation and purification 
of Pyro polymerase takes place in about 3 stages, each 
10 of which is performed at, and preferably below, room 
temperature, preferably about 4°C. 

In the first step, the cells are concentrated 
from the growth medium, typically by centrifugation or 
filtration. 

15 in the second step, the cells are lysed and the 

supernatant is segregated and recovered from the 
cellular debris. Lysis is typically accomplished by 
mechanically applying sheer stress and/ or enzymatic 
digestion. Segregation of the supernatant is usually 

20 accomplished by centrifugation. 

The third step removes nucleic acids and some 
protein. The supernatant from the second step is 
applied to an agarose resin strong anionic exchange 
column, such as Q-sepharose from Pharmacia 

25 (Piscataway, NJ) equilibriated with column buffer [50 
mM tris-hydroxymethylaminomethane (Tris) , pH 8.2, 10 
mM beta mercaphroethanol, and ImM 
ethylenediaminetetraacetic acid (EDTA) ] . The 
supernatant is washed through the column with the 

30 column buffer and the pass-through and washes are 
collected and centrifuged to remove any insoluble 
material. The supernatant is segregated, usually 
dialyzed, and then recovered to form a fraction 
containing partially purified Pyro polymerase. 

35 The fourth step removes substantially all (90%) 



WO 92/09689 PCT/US91/09026 

-13- 

of the remaining contaminating proteins and comprises 
applying the fraction recovered from step three to a 
phosphocellulose column equilibriated with the before 
described column buffer. The column is washed with 
5 the column buffer until the optical density of the 

wash eluate is at the buffer baseline at 280 nm. The 
immobilized Pyro polymerase is thereafter eluted with 
a linear salt gradient comprising O M to about 0.7 M 
salt dissolved in the column buffer, the salt being 
10 NaCI, KC1, and the like. Protein eluted from the 
column at about 200 mM salt typically contains the 
highest concentrations of assayable Pyro polymerase. 

In preferred embodiments, the Pyro polymerase 
preparation obtained from the fourth step is further 
15 purified in a fifth step by FPLC chromatography 

through a high performance cation exchange column, 
such as the Mono S column available from Pharmacia, 
Piscataway, NJ, equilbriated with the before described 
column buffer. After application, the column is 
20 washed to remove non-bound contaminants. The 

immobilized Pyro polymerase is then eluted with the 
before-described linear salt gradient at about 120 nM 
salt concentration. The Pyro polymerase eluate is 
then typically dialysed" against the column buffer to 
25 remove excess salt. A stabilizing agent, such as 

glycerol, can be added to the preparation at this time 
to facilitate low temperature storage. Typically, the 
fraction is again dialyzed against a low salt buffer, 
e.g., 50 mM Tris pH 7.5, 1 mM dithiothreitol, o.l mM 
30 EDTA, 0.1% Tween 20, and 0.1% non-idet P40. 

In further preferred embodiments the Pyro 
polymerase preparation of step five is applied to a 
crosslinked agarose affinity column, such as the Affi- 
Gel Blue column available from BioRad, Richmond, CA, 
35 equilibrated with the before-described column buffer. 
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Non-bound protein is washed from the column and the 
Pyro polymerase is eluted with the before-described 
salt gradient with the Pyro polymerase typically being 
recovered at about 280 mM salt concentration. 
5 Thereafter, the Pyro polymerase preparation is usually 
concentrated about 5-10 fold and dialysed against 
column buffer. Typically, a stabilizing agent, such 
as glycerol, lis added to the preparation to facilitate 
low temperature storage. 

X0 The amino-terminal amino acid residue sequence of 

Pyro polymerase can be determined by any suitable 
method, such as by automated Edman degradation, and 
the like. The amino acid residue sequence of a 
preferred Pyro polymerase is shown in SEQ ID NO 1 from 

15 residue 1 to 775. 

The molecular weight of the dialyzed product may 
be determined by any technique, for example, by sodium 
dodecylsulf ate-polyacrylamide gel electrophoresis 
(SDS-PAGE) using protein molecular weight markers. 

20 Native Pyro polymerase purified by the above method 
has a relative molecular weight, determined by SDS- 
PAGE under non-reducing conditions, of about 90,000- 
93,000 daltons. 

In preferred embodiments, Pyro polymerase is used 

25 in combination with a thermostable buffering agent 
such as TAPS (N-tris[Hydroxymethyl]methyl-3- 
aminopropanesulfonic acid; ([2-Hydroxy-l, 1- 
bis (hydroxy-methyl) -ethyl )amino-l-propanesulf onic 
acid), available from Sigma, St. Louis, MO (Catalog 

30 P7905) . 

C. Recombinant Pvro Polymerase 

Pyro polymerase can also be produced by 
recombinant DNA (rDNA) techniques, as the gene 
35 encoding the enzyme can be cloned from P, fuyiosus 
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genomic DNA. Thus, the present invention also 
contemplates a DNA segment consisting essentially of a 
sequence of nucleotide base sequence encoding a Pyro 
polymerase of this invention. An exemplary DNA 
5 sequence, obtained from the native gene, coding for a 
preferred Pfu I protein is shown in SEQ ID NO 2 from 
nucleotide base 224 to base 2548, which spans the 
coding portion of SEQ ID NO 2. 

The isolated gene can be operably linked to an 

10 expression system to form an rDNA capable of 

expressing, in a compatible host, Pyro polymerase. 

Of course, modifications to the primary structure 
itself by deletion, addition, or alteration of the 
amino acids incorporated into the protein sequence 

15 during translation can be made without destroying the 
activity of the protein. Such substitutions or other 
alterations result in proteins having an amino acid 
sequence encoded by DNA falling within the 
contemplated scope of the present invention. 

20 1. Cloning and Expression of the Pyro 

Polymerase Gene 

Polyclonal antiserum from rabbits 
immunized with the purified 90,000-93,000 dalton 
polymerase of this invention can be used to probe a P^ 

25 furiosus partial genomic expression library to obtain 
the appropriate coding sequence as described below. 
The cloned genomic sequence can be expressed as a 
fusion protein, expressed directly using its own 
control sequences, or expressed by constructions using 

30 control sequences appropriate to the particular host 
used for expression of the enzyme. 

Thus, the complete coding sequence for Pyro 
polymerase from which expression vectors applicable to 
a variety of host systems can be constructed and the 

35 coding sequence expressed. It is also evident from 
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the foregoing that portions of the Pyro polymerase- 
encoding sequence are useful as probes to retrieve 
other similar thermostable polymerase-encoding 
sequences in a variety of Archaebacteria species, 
5 particularly from other Pyrococcus species and P^ 
furiosus strains. Accordingly, portions of the 
genomic DNA encoding at least six contiguous amino 
•acids can be synthesized and used as probes to 
retrieve additional DNAs encoding an Archaebacteria 

10 thermostable polymerase. Because there may not be a 
precisely exact match between the nucleotide sequence 
in the P. furiosus form described herein and that in 
the corresponding portion of other species or strain, 
oligomers containing approximately 18 nucleotides 

15 (encoding the six amino acid stretch) are probably 

necessary to obtain hybridization under conditions of 
sufficient stringency to eliminate false positives. 
The sequences encoding six amino acids would supply 
information sufficient for such probes. 

20 Exemplary degenerate Pfu DNA polymerase I gene 

probes are shown in Table 1. 
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Table 1 

Degenerate Pfu DNA Polymerase I G ene Probes 1 
A 

T T T I 
5 1) 5 • GACTACATCACCGAIGAIGG3 >. 

T T A A 

2) S^CCTCCTCIGTIATGTAGTCS 1 

10 T A A T 

3) 5 1 TTCAAGAAGAACGG-3 1 

A T T A 

4 ) 5 • CCGTTCTTCTTGAA-3 • 



15 



Nucleotide sequence degenerate substitutions are 
bases directly above the positions in the sequence 
where the substitutions were made. In some cases, 
degeneracy was accomplished by substituting 
20 inosine (I) in the sequence. 

A preferred and exemplary cloning protocol for 
isolation of a pfu pol I gene is described in the 
Examples. From the clone pF72 described in the 

25 Examples, the nucleotide sequence of a preferred gene 
encoding pfu pol I was described and is shown in SEQ 
ID NO 2, and can be utilized for the production of 
recombinant Pyro polymerase. 

In general terms, the production of a recombinant 

30 form of Pyro polymerase typically involves the 
following: 

First, a DNA is obtained that encodes the mature 
(used here to include all muteins) enzyme or a fusion 
of the Pyro polymerase either tQ an additional 
35 sequence that does not destroy its activity, or to an 
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additional sequence cleavable under controlled 
conditions (such as treatment with peptidase) to give 
an active protein. If the sequence is uninterrupted 
by introns it is suitable for expression in any host. 
5 This sequence should be in an excisable and 
recoverable form. 

The excised or recovered coding, sequence is then 
preferably placed in operable linkage with suitable 
control sequences in a replicable expression vector. 

10 The vector is used to transform a suitable host and 
the transformed host cultured under favorable 
conditions to effect the production of the recombinant 
Pyro polymerase. Optionally the Pyro polymerase is 
isolated from the medium or from the cells; recovery 

15 and purification of the protein may not be necessary 
in some instances, where some impurities may be 
tolerated. 

Each of the foregoing steps can be done in a 
variety of ways* For example, the desired coding 

20 sequences may be obtained from genomic fragments and 

used directly in appropriate hosts. The constructions 
for expression vectors operable in a variety of hosts 
are made using appropriate replicons and control 
sequences, as set forth' below. Suitable restriction 

25 sites can, if not normally available, be added to the 
ends of the coding sequence so as to provide an 
excisable gene to insert into these vectors. 

The control sequences, expression vectors, and 
transformation methods are dependent on the type of 

30 host cell used to express the gene. Generally, 

procaryotic, yeast, insect or mammalian cells are 
presently useful as hosts. Procaryotic hosts are in 
general the most efficient and convenient for the 
production of recombinant proteins and therefore are 

35 preferred for the expression of Pyro polymerase. 
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2. Control Sequences and Corresponding 
dosts 

Procaryotes most frequently are 
represented by various strains of E. coli . However, 
5 other microbial strains may also be used, such as 
bacilli, for example, Bacillus subtillis, various 
species of Pseudomonas, or other bacterial strains. 
In such procaryotic systems, plasmid vectors that 
contain replication sites and control sequences 

10 derived from species compatible with the host are 

used. For example, E. coli is typically transformed 
using derivatives of pBR322, a plasmid derived from an 
E, coli species by Bolivar, et al., Gene . (1977) 2:95 
and Sutcliffe, Nuc. Acids Res. . (1978) 5:2721-28. 

15 pBR322 contains genes for ampicillin and tetracycline 
resistance, and thus provides additional markers that 
can be either retained or destroyed in constructing 
the desired vector. Commonly used procaryotic control 
sequences, which are defined herein to include 

20 promoters for transcription initiation, optionally 
with an operator, along with ribosome binding site 
sequences, include such commonly used promoters as the 
B-lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang, et al., Nature . (1977) 198:1056), the 

25 tryptophan (trp) promoter system (Goeddel, et al., 
Nucleic Acids Res. . (1980) 8:4057) and the lambda- 
derived P L promoter (Shimatake, et al. , Nature r (1981) 
292:128) and N-gene ribosome binding site, which has 
been made useful as a portable control cassette (as 

30 set forth in U.S. Patent No. 4,711,845), which 

comprises a first DNA sequence that is the P L promoter 
operably linked to a stream of a third DNA sequence 
having at least one restriction site that permits 
cleavage with six bp 3* of the N RBS sequence. Also 

35 useful is the phosphatase A (phoA) system described by 
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Change, et al. in European Patent Publication No. 
196,864. However, any available promoter system 
compatible with procaryotes can be used. Typical 
bacterial plasmids are pUC8, pUC9, pBR322 and pBR329 
5 available from Bio-Rad Laboratories, (Richmond, CA) 
and pPL and pkk233-2, available from Pharmacia 
(Piscataway, NJ) or Clone Tech (Palo Alto, CA) . 

In addition to bacteria, eucary otic microbes, 
such as yeast, may also be used as hosts. Laboratory 

10 strains of Saccharomvces cerevisiae. Baker's yeast, 

are most used, although a number of other strains are 
commonly available. While vectors employing the 2 
micron origin, of replication are illustrated (Broach, 
J.R., Meth. Enz. . (1983) 101:307), other plasmid 

15 vectors suitable for yeast expression are known (see, 
for example, Stinchcomb, et al., Nature, (1979) 
282:39, Tschempe, et al., Geng, (1980) 10:157, Clarke, 
L., et al., Meth. Enz (1983) 101:300), Brake et al., 
Proc Natl. Aead, Sci. USA. (1984) 81:4642-4647, and 

20 Halewell et al.. Biotechnology. (1987) 5:363-366. 

Control sequences for yeast vectors include promoters 
for the synthesis of glycolytic enzymes (Hess, et al., 
J. Adv. Enzvme Recr, . (1968) 7:149; Holland, et al. , 
Biotechnology (1978) 17:4900). 

25 Additional promoters known in the art include the 

promoter for 3-phosphoglycerate kinase (Hitzeman, et 
al., J- Biol. Chem. . (1980) 255:2073) and those for 
other glycolytic enzymes, such as glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate 

30 decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. Other 
promoters that have the additional advantage of 

35 transcription controlled by growth conditions are the 
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promoter regions for alcohol dehydrogenase 2, 
iscoyto^hrome C, acid phosphatase, degradative enzymes 
associated with nitrogen metabolism, and enzymes 
responsible for maltose and galactose utilization 
5 (Holland, supra) . 

It is also believed that terminator sequences are 
desirable at the 3* end of the coding sequences. Such 
terminators are found in the 3 9 untranslated region 
following the coding sequences in yeast-derived genes. 

10 Many of the vectors illustrated contain control 

sequences derived from the enolase gene containing 
plasmid peno46 (Holland, M. M. , et al., J. Biol Chem. , 
(1981) 256:1385) or the LEU2 gene obtained from YEpl3 
(Broach, J. , et al., Gene . (1978) 8:21); however, any 

15 vector containing a yeast-compatible promoter, origin 
of replication, and other control sequences is 
suitable. 

It is also, of course, possible to express genes 
encoding polypeptides in eucaryotic host cell cultures 

20 derived from multicellular organisms. See, for 

example, Tissue Culture . Academic Press, Cruz and 
Patterson, editors (1973) . Useful host cell lines 
include murine myelomas N51, VERO and HeLA cells, and 
Chinese hamster ovary (CHO) cells available from the 

25 ATCC as CCL61, and NIH/3T3 mouse cells available from 
the ATCC as CRL1658. Expression vectors for such 
cells ordinarily include promoters and control 
sequences compatible with mammalian cells such as, for 
example, the commonly used early and late promoters 

30 from Simian Virus 40 (SV 40) (Fiers, et al., Nature, 
(1978) 273:113), or other viral promoters such as 
those derived from polyoma, Adenovirus 2, bovine 
papilloma virus, or avian sarcoma viruses, or 
immunoglobulin promoters and heat shock promoters. A 

35 system for expressing DNA in mammalian systems using 



the BPV as a vector is disclosed in U.S. Patent No. 
4, 419 , 446. A modification of this system is described 
in U.S. Patent No. 4,601,978. General aspects of 
mammalian cell host system transformations have been 
described in U.S. Patent No. 4,399,216. It now 
appears, also, that "enhancer" regions are important 
in optimizing expression; these sure, generally, 
sequences found upstream of the promoter region. 
Origins of replication may be obtained, if needed, 
from viral sources. However, integration into the 
chromosome is a common mechanism for DNA replication 
in eucaryotes. 

Plant cells are also now available as hosts, and 
control sequences compatible with plant cells such as 
the nopaline synthase promoter and polyadenylation 
signal sequences (Depicker, A. , et al. , J. Mol. Appl. 
Gen. . (1982) 1:561) are available. See, also, U.S. 
Patents No. 4,962,028, No. 4,956,282, No. 4,886,753 
and NO. 4,801,540. 

Recently, in addition, expression systems 
employing insect cells utilizing the control systems 
provided by baculovirus vectors have been described 
(Miller, D.W. , et al., in Genetic Engineering (1986) 
Setlow, J.K. et al., eds., Plenum Publishing, Vol. 8, 
pp. 277-297). See, also, U.S. Patents No. 4,745,051 
and No. 4,879,236. These systems are also successful 
in producing Pyro polymerase. 

A preferred DNA segment containing both the pf u 
pol I coding portion and control sequences at the 5' 
and 3 1 termini of the coding portion is shown in SEQ 
ID NO 2 from nucleotide base 1 to base 3499. 
3. Transformations 

The recombinant DNA molecules of the 
present invention are introduced into host cells, via 
a procedure commonly known as transformation or 



transfection. Transformation of appropriate host 
cells with a recombinant DNA molecule of the present 
invention is accomplished by well known methods that 
typically depend on the type of vector used. With 
regard to transformation of procaryotic host cells or 
other cells that contain substantial cell wall 
barriers, see, for example, Cohen et al., Proc. Natl. 
Acad. Sci. USA . 69:2110 (1972); and Maniatis et al., 
Molecular Clonino. A Laboratory Manual. Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1982) . 
With regard to transformation of vertebrate cells with 
retroviral vectors containing rDNA, see, for example, 
Sorge et al., Mol. Cell. Biol. . 4:1730-37 (1984); and 
Wigler et al., Proc. Natl. Acad. Sci. USA . 76:1373-76 
(1979). 

Infection with Aarobacterium tumefaciens (Shaw, 
C.H., et al., Gene . (1983) 23:315) is used for certain 
plant cells. For mammalian cells without cell walls, 
the calcium phosphate precipitation method of Graham 
and van der Eb, Virology (1978) 52:546 is preferred. 
Transformations into yeast are carried out according 
to the method of Van Solingen, P., et al., J. Bact. 
(1977) 130:946 and Hsiao, C.L., et al., Proc. Natlf ~ 
Acad. Sci. (USA) . (1979) 76:3829. 

Successfully transformed cells, i.e., cells that 
contain a recombinant DNA (rDNA) molecule of the 
present invention, are usually monitored by an 
appropriate immunological, hybridization or functional 
assay. For example, cells resulting from the 
introduction of an rDNA of the present invention can 
be cloned to produce monoclonal colonies. Cells from 
those colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using a 
method such as that described by Southern, J. Mol. 
Biol . . 98:503 (1975) or Berent et al., Biotech. . 3:208 
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(1985) . 

In addition to directly assaying for the presence 
of rDNA, successful transformation can be confirmed by 
well known immunological methods when the rDNA is 
5 capable of directing the expression of Pyro 
polymerase. For example, cells successfully 
transformed with a subject rDNA containing an 
expression vector produce a polypeptide displaying a 
characteristic antigenicity. Samples of a culture 

10 containing cells suspected of being transformed are 

harvested and assayed for a subject polypeptide (Pyro 
polymerase) using antibodies specific for that 
polypeptide antigen, such as those produced by an 
appropriate hybridoma. 

X5 A particularly convenient assay technique 

involves fusing the Pyro polymerase-encoding DNA to a 
Lac Z gene in a suitable plasmid, e.g. pLG. Since the 
plasmid lacks a promoter and Shine-Da lgarno sequence, 
no 6-galactosidase is synthesized. However, when a 

20 portable promoter fragment is properly positioned in 
front of the fused gene, high levels of a fusion 
protein having 6-galactosidase activity should be 
expressed. The plasmids sure used to transform Lac- 
bacteria which are scored for B-galactosidase activity 

25 on lactose indicator plates. Plasmids having 

optimally placed promoter fragments are thereby 
recognized. These plasmids can then be used to 
reconstitute the fusion protein gene which is 
expressed at high levels. 

30 Thus, in addition to the transformed host cells 

themselves, cultures of the cells are contemplated as 
within the present invention. The cultures include 
monoclonal (clonally homogeneous) cultures, or 
cultures derived from a monoclonal culture, in a 

35 nutrient medium. Nutrient media useful for culturing 
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t r an s formed host cells are well known in the art and 
can be obtained from several commercial sources. In 
embodiments wherein the host cell is mammalian, a 
"serum-free" medium is preferably used. 
5 The present method entails culturing a nutrient 

medium containing host cells transformed with a 
recombinant DNA molecule of the present invention that 
is capable of expressing a gene encoding a subject 
polypeptide. The culture is maintained for a time 

10 period sufficient for the transformed cells to express 
the subject polypeptide. The expressed polypeptide is 
then recovered from the culture. 

Once a gene has been expressed in high levels, a 
DNA fragment containing the entire expression 

15 assembly, e.g., promoter, ribosome-binding site, and 
fusion protein gene) may be transferred to a plasmid 
that can attain very high copy numbers. For instance, 
the temperature- inducible "runaway replication" vector 
pKN402 may be used. Preferably, the plasmid selected 

20 will have additional cloning sites which allow one to 
score for insertion of the gene assembly. See, 
Bittner et al. Gene . 15:31 (1981). Bacterial cultures 
transformed with the plasmids are grown for a few 
hours to increase plasmid copy number, e.g., to more 

25 than 1000 copies per cell. Induction may be performed 
in some cases by elevated temperature and in other 
cases by addition of an inactivating agent to a 
repressor. Potentially very large increases in cloned 
fusion proteins can be obtained in this way. 

30 4. Construction of a Lambda Expression 

Library 

The strategy for isolating DNA encoding 
desired proteins such as the Pyro polymerase encoding 
DNA, using the bacteriophage vector lambda gtll, is as 
35 follows. A library can be constructed of EcoHI- 



flanked Alul fragments, generated by complete 
digestion of p. furiosus DNA. inserted at the EcoRI 
site in the lambda gtll phage (Young and Davis, Proc. 
Natl. Acad, Sci. fUSAK f!983) 80:1194-1198). Because 
the unique EcoRI site in this bacteriophage is located 
in the carboxyl-terminus of the B-galactosidase gene, 
inserted DNA (in the appropriate frame and 
orientation) is expressed as protein fused with B- 
galactosidase under the control of the lactose operon 
prompter / operator . 

Genomic expression libraries are then screened 
using the antibody plaque hybridization procedure. A 
modification of this procedure, referred to as 
"epitope selection," uses antiserum against the fusion 
protein sequence encoded by the phage, to confirm the 
identification of hybridized plaques. Thus, this 
library of recombinant phages could be screened with 
antibodies that recognize the 90,000-93,000 dalton 
Pyro polymerase in order to identify phage that carry 
DNA segments encoding the antigenic determinants of 
the Pyro polymerase protein. 

Approximately 2 X 10 5 recombinant phage are 
screened using rabbit Pyro polymerase antiserum. In 
this primary screen, positive signals are detected and 
one or more of these plaques are purified from 
candidate plaques which failed to react with preimmune 
serum and reacted with immune serum and analyzed in 
some detail. Anti-Pyro polymerase antibodies can be 
prepared by a number of known methods, see, for 
example, U.S. Patent No. 4,082,735, No. 4,082,736, and 
No. 4,493,795. 

To examine the fusion proteins produced by the 
recombinant phage, lysogens of the phage in the host 
Y1089 are produced. Upon induction of the lysogens 
and gel electrophoresis of the resulting proteins, 



each lysogen may be observed to produce a new protein, 
not found in the other lysogens, or duplicate 
sequences may result. Phage containing positive 
signals are picked. Typically, one positive plaque is 
picked for further identification and replated at 
lower densities to purify recombinants and the 
purified clones are analyzed by size class via 
digestion with EcoRI restriction enzyme. Probes can 
then be made of the isolated DNA insert sequences and 
labeled appropriately and these probes can be used in 
conventional colony or plaque hybridization assays 
described in Maniatis et al . # Molecular Cloning; A 
Laboratory Manual . (1982), the disclosure of which is 
incorporated herein by reference. 

5. Recombinant DNA Molecules 

The present invention further 
contemplates a recombinant DNA (rDNA) that includes a 
Pyro polymerase-encoding DNA segment of the present 
invention operatively linked to a vector for 
replication and/or expression. Preferred rDNA 
molecules contain less than 50,000 nucleotide base 
pairs, usually less than 20,000 base pairs and 
preferably less than about 10,000 base pairs. 
Preferably, a Pyro polymerase-encoding DNA of this 
invention is in the form of a plasmid, cosmid or 
phage • 

A preferred rDNA molecule includes a nucleotide 
sequence shown in SEQ ID NO 2 from nucleotide base 224 
to base 2548. 

A rDNA molecule of the present invention can be 
produced by operatively linking a vector to a DNA 
segment of the present invention. 

As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 
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which it has been operatively linked Preferred 
vectors are those capable of autonomous replication 
and/or expression of nucleic acids to which they are 
operatively linked are referred to herein as 
5 "expression vectors" . As used herein, the term 

"operatively linked" , in reference to DNA segments, 
describes that the nucleotide sequence is joined to 
the vector so that the sequence is under the 
transcriptional and translation control of the 
10 expression vector and can be expressed in a suitable 
host cell. 

As is well known in the art, the choice of vector 
to which a protein encoding DNA segment of the present 
invention is operatively linked depends upon the 

15 functional properties desired, e.g., protein 

expression, and upon the host cell to be transformed. 
These limitations are inherent in the art of 
constructing recombinant DNA molecules. However, a 
vector contemplated by the present invention is at 

20 least capable of directing the replication, and 

preferably also expression, of a gene operatively 
linked to the vector. 

In preferred embodiments, a vector contemplated 
by the present invention includes a procaryotic 

25 replicon, i.e., a DNA sequence having the ability to 
direct autonomous replication and maintenance of the 
recombinant DNA molecule extrachromosomally in a 
procaryotic host cell, such as a bacterial host cell, 
transformed therewith. Such replicons are well known 

30 in the art. In addition, those embodiments that 

include a procaryotic replicon may also include a gene 
whose expression confers a selective advantage such as 
amino acid nutrient dependency or drug resistance to a 
bacterial host transformed therewith as is well known, 

35 in order to allow selection of transformed clones. 



Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin, tetracycline , or 
kanamycin. 

Those vectors that include a procaryotic replicon 
may also include a procaryotic promoter capable of 
directing the expression (transcription and 
translation) of the gene transformed therewith. A 
promoter is an expression control element formed by a 
DNA sequence that permits binding of RNA polymerase 
and transcription to occur. Promoter sequences 
compatible with bacterial hosts are typically provided 
in plasmid vectors containing convenient restriction 
sites for insertion of a DNA segment of the present 
invention. Bacterial expression systems , and choice 
and use of vectors in those systems is described in 
detail in "Gene Expression Technology 11 , Meth. 
Enzymol. . Vol 185, Goeddel, Ed., Academic Press, NY 
(1990). 

Expression vectors compatible with eucaryotic 
cells, preferably those compatible with vertebrate 
cells, can also be used to form the recombinant DNA 
molecules of the present invention. Eucaryotic cell 
expression vectors are well known in the art and are 
available from several commercial sources. Typically, 
such vectors are provided containing convenient 
restriction sites for insertion of the desired gene. 
Typical of such vectors are pSVL and pKSV-10 
(Pharmacia) , pBPV-l/pML2d (International 
Biotechnologies, Inc.), and pTDTl (ATCC, #31255). 

In preferred embodiments, the eucaryotic cell 
expression vectors used to construct the recombinant 
DNA molecules of the present invention include a 
selectable phenotypic marker that is effective in a 
eucaryotic cell, such as a drug resistance selection 
marker or selective marker based on nutrient 
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dependency. A preferred drug resistance marker is the 
gene whose expression results in neomycin resistance, 
i.e.., the neomycin phosphotransferase (neo) gene. 
Southern et al., J. Mol. AppI. Genet. , 1:327-341 

5 (1982) . 

The use of retroviral expression vectors to form 
the rDNAs of the present invention is also 
contemplated. As used herein, the term "retroviral 
expression vector" refers to a DNA molecule that 

10 includes a promoter sequence derived from the long 
terminal repeat (LTR) region of a retrovirus genome. 

In addition to using strong promoter sequences to 
generate large quantities of mRNA coding for the 
expressed fusion proteins of the present invention, it 

15 is desirable to provide ribbsome-binding sites in the 
mRNA to ensure efficient translation. The ribosome- 
binding site in E. coli includes an initiation codon 
(AUG) and a sequence 3-9 nucleotides long located 3-11 
nucleotides upstream from the initiation codon (the 

20 Shine-Dalgarno sequence). See, Shine et al., Nature, 
254:34 (1975). Methods for including a ribosome- 
binding site in mRNAs corresponding to the expressed 
proteins are described by Haniatis, et al. Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor 

25 Laboratory Press, NY pp. 412-417 (1982). Ribosome 
binding sites can be modified to produce optimum 
configuration relative to the structural gene for 
maximal expression of the structural gene. Halewell 
etal., Acid. Res.. (1985) 13:2017-2034. 

30 Construction of suitable vectors containing the 

desired coding and control sequences employs standard s 
ligation and restriction techniques that are well 
understood in the art. Isolated plasmids, DNA 
sequences, or synthesized oligonucleotides are 

35 cleaved, tailored, and religated in the form desired. 
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Site-specif ic DNA cleavage is performed by 
treating with the suitable restriction enzyme (or 
enzymes) under conditions that are generally 
understood in the art, and the particulars of which 
5 are specified by the manufacturer of these 

commercially available restriction enzymes. See e.g., 
New England Biolabs, Product Catalog. In general, 
complete digestion is obtained by admixing about l ug 
of plasmid or DNA sequence with one unit of enzyme in 

10 about 20 ml of buffer solution. Incubation times of 
about one hour to two hours at about 37 °C are 
workable, although variations can be tolerated. After 
each incubation, protein is removed by extraction with 
phenol/chloroform, and may be followed by ether 

15 extraction, and the nucleic acid recovered from 

aqueous fractions by precipitation with ethanol. If 
desired, size separations is found in Methods in 
Enzymology (1980) 65:499-560. 

Restriction-cleaved fragments may be blunt-ended 

20 by treating with large fragment E. coli DNA polymerase 
I (Klenow) in the presence of the four deoxynucleotide 
triphosphates (dNTPs) using incubation times of about 
15 to 25 minutes at 20 to 25°C in 50 mM Tris pH 7.6, 
50 mM NaCl, 10 mM MgCl 2 , 10 mM DTT and 50-100 MM 

25 dNTPs. The Klenow fragment fills in at 5 1 sticky 

ends, but chews back protruding 3 9 single strands even 
though the four dNTPs are present. If desired, 
selective repair can be performed by supplying only 
one of the, or selected, dNTPs within the limitations 

30 dictated by the nature of the sticky ends. After 

treatment with Klenow, the mixture is extracted with 
phenol /chloroform and ethanol precipitated. Treatment 
under appropriate conditions with SI nuclease results 
in hydrolysis of any single-stranded portion. 

35 Synthetic oligonucleotides may be prepared using 



the triester method of Matteucci, et al., ( J. Am. 
Chem. Soc. , (1981) 103:3185-3191) or using automated 
synthesis methods. Kinasing of single strands prior 
to annealing or for labeling is achieved using an 
excess , e.g., approximately 10 units of polynucleotide 
kinase to 1 nM substrate in the presence of 50 mM 
Tris, pH 7.6, 10 mN MgCl 2 , 5 mM dithiothreitol, 1-2 mM 
ATP. If kinasing is for labeling of probe, the ATP 
will contain high specific activity 32 P. 

Ligations are performed in 15-30 Ml volumes under 
the following standard conditions and temperatures: 20 
mM Tris-Cl pH 7.5, lOmM MgCl E , 10 mM DTT, 33 fig/inl 
BSA, 10 mM-50 mM NaCl, and either 40 ATP, 0.01-0.02 
(Weiss) units TA DNA ligase at 0°C (for "sticky end" 
ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA 
ligase at 14 °C (for "blunt end" ligation) . 
Intermolecular "sticky end" ligations are usually 
performed at 33-100 pg/ml total DNA concentrations (5- 
100 nM total end concentration) • Intermolecular blunt 
end ligations (usually employing a 10-30 fold molar 
excess of linkers) are performed at 1 pM total ends 
concentration . 

In vector construction employing "vector 
fragments", the vector fragment is commonly treated 
with bacterial alkaline phosphatase (BAP) in order to 
remove the 5 1 phosphate and prevent religation of the 
vector. BAP digestions are conducted at PH 8 in 
approximately 150 mM Tris, in the presence of Na + and 
Mg* 2 using about 1 unit of BAP per mg of vector at 
60 °C for about one hour. In order to recover the 
nucleic acid fragments, the preparation is extracted 
with phenol/ chloroform and ethanol precipitated. 
Alternatively, religation can be prevented in vectors 
that have been double digested by additional 
restriction enzyme digestion of the unwanted 
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fragments. 

For portions of vectors derived from cDNA or 
genomic DNA that require sequence modifications, sit- 
specif ic primer-directed mutagenesis is used, this 
5 technique is now standard in the art, and is conducted 
using a primer synthetic oligonucleotide complementary 
to a single-stranded phage DNA to be mutagenized 
except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic 

10 oligonucleotide is used as a primer to direct 

synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA is transformed into 
a phage-supporting host bacterium. Cultures of the 
transformed bacteria are plated in top agar, 

15 permitting plaque formation from single cells that 
harbor the phage. 

Theoretically, 50% of the new plaques will 
contain the phage having, as a single strand, the 
mutated form; 50% will have the original sequence. 

20 The plaques are transferred to nitrocellulose filters 
and the "lifts 1 * hybridized with kinased synthetic 
primer at a temperature that permits hybridization of 
an exact match, but at which the mismatches with the 
original strand are sufficient to prevent 

25 hybridization. Plaques that hybridize with the probe 
are then picked and cultured, and the DNA is 
recovered. 

6. Verification of Co nstruction 
Correct ligations for plasmid 
30 construction are confirmed by first transforming E. 

coli strain MM294, or other suitable host, with the 
ligation mixture. Successful transformants are 
selected by ampicillin, tetracycline or other 
antibiotic resistance or using other markers, 
35 depending on the mode of plasmid construction, as is 
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understood in the art. Plasmids from the 
transf ormants are then prepared according to the 
method of Clewell, D.B., et al. , Proc. Natl. Acad. 
sci, (USA) . (1969) 62:1159, optionally following 
5 chloramphenicol amplication (Clewell, D.B., Jj_ 

Bacfceriol. . (1972), 110:667). The isolated DNA is 
analyzed by restriction digest mapping and/ or 
sequenced by the dideoxy method of Sanger, F. , et al., 
Proc. Natl- Acad. Sci. fUSAK (1977) 74:5463 as 
10 further described by Messing, et al., Nucleic Acids 

Res. . (1981) 9:309, or by the method of Maxam, et al.. 
Methods in Enzvmoloav, (1980) 65:499. 

Host strains useful in cloning and expression are 
as follows: 

15 For cloning and sequencing, and for expression of 

constructions under control of most bacterial 
promoters, E. coli strain MM294 obtained from E- coli 
Genetic Stock Center GCSC #6135, is particularly 
useful. For expression under control of the P^^s 

20 promoter, E. coli strain K12 MC1000 lambda lysogen, 
ti^np 1 * 57 SusP eo (ATCC 39531), may be used. Also 
useful is E. coli DG116, (ATCC 53606). 

For M13 phage recombinants, E. coli strains 
susceptible to phage infection, such as E. coli K12 

25 strain DG98, are employed. The DG98 strain has been 
deposited with ATCC July 13, 1984 and has accession 
number 39768. 

The thermostable enzyme of this invention may be 
used for any purpose in which such enzyme is necessary 

30 or desirable. In a particularly preferred embodiment, 
the enzyme herein is employed in the amplification 
protocol set forth below. 
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Examples 

The following examples are intended to 
illustrate, but not limit, the present invention. 

5 1. Culturina of Pyrococcus furibsus and Preparation 

of Pf Cell Paste 

The following describes how the hyperthermophilic 
archaebacterium, P. furiosus . is routinely grown in a 
500 liter fermentor for the purpose of obtaining cell 

10 mass in sufficient quantities for large scale protein 
purification. It is a modified version [Bryant et 
al., J. Biol. Chem. . 264:5070-5079 (1989)] of the 
original protocol of Fiala et al., Arch . Microbiol , r 
145:56-61 (1986) . 

15 For culture maintenance, P. furiosus (DSM 3638) 

is routinely grown at 85-88 °C as a closed static 
culture in 100 ml of the medium described in Table 2. 

Table 2 



20 Maltose 5 g/1 

NH 4 C1 1.25 g/1 

Elemental Sulfur 5 g/1 

Na 2 S 0.5 g/1 
Synthetic Sea Water 1 

25 Vitamin mixture 2 1 ml/1 

FeCl 3 25 MM 

Na 2 W0 4 10 MM 

Yeast Extract 0.01% 



30 1. Synthetic Sea Water: 

NaCl, 13.8 g/1 
MgS0 4 , 3.5 g/1 
MgCl 2 , 2.7 g/1 
KC1, 0.3 g/1 

35 CaCl 2 , 0.75 g/1 
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KH 2 P0 4/ 0.5 g/1 

NaBr, 0.0-5 g/1 

KI, 0.05 g/1 

H3BO3, 0.015 g/1 
5 Sodium citrate, 0.005 g/1 

2. Vitamin mixture [Balch et al., Microbiol. Rev. , 
43:260-296 (1979)]: 

Biotin, 2 mg/1 

Folic acid, 2 mg/1 
10 Pyridoxine hydrochloride, 10 mg/1 

Thiamine hydrochloride, 5 mg/1 

Riboflavin, 5 mg/1 

Nicotinic acid, 5 mg/1 

DL-Calcium pantothenate, 5 mg/1 
15 Vitamin B 12 , 0.1 mg/1 

p-Aminobenzoic acid, 5 mg/1 

Lipoic acid, 5 mg/1 

Growth is monitored by the increase in turbidity 

20 at 600 nm. Cells can be stored in the same medium at 
4°C and remain viable for at least a year, although 
periodic transfer is recommended. 

Large scale (preparative) growth of p. furjosus 
was performed as follows: 

25 Growth medium according to Table 1, was prepared, 

except that the sulfide was replaced with titanium 
(III) nitrilotriacetate [final concentration, 30 nK as 
described in Moench et al., J- Microbiol. Meth., 
1:199-202 (1983)] and the elemental sulfur is omitted. 

30 The medium was then sparged with Argon (Ar) . 

A two liter flask was inoculated with two 100 ml 
cultures . The two liter culture was used as an 
inoculum for a 20 liter culture. Two 20 liter 
cultures were used to inoculate a 500 liter culture. 

35 The culture was maintained at 88 °C, bubbled with Ar 
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(7.5 liters/min) and stirred at about 50 rpm. After 
about 20 hours (A m ~ °* 5 ) the cells were harvested 
with a Sharpies continuous flow centrifuge at 100 
liters/hour. The cells were frozen in liquid N2 
5 immediately after harvesting. The yield of cells is 
typically 400-600g wet weight. 

It should be noted that P. furiosus has a 
fermentative type of metabolism and produces organic 
acids, C0 2 and H 2 as final products. H 2 production 

10 inhibits growth, so cultures have to be sparged with 
Ar (or any inert gas) to remove H 2 . Alternatively, 
elemental sulfur may be added. ' In this case, the 
reductant that would otherwise be used to generate H 2 
is used to reduce elemental sulfur to H 2 S. The 

15 addition of elemental sulfur is convenient for small 
scale cultures in glass vessels, but its reduction 
cannot be used to remove inhibitory H 2 in 500 liter 
stainless steel ferment or s because of the corrosive 
nature of H 2 S. 

20 

2. Purification of Pf DNA Polymerase I 
A. Lvsis of Pf Cell Paste 

Fifty grams (g) of Pf cell paste prepared in 
Example 1 were thawed at room temperatur e . Two 

25 hundred milliliters (ml) of lysis buffer consisting of 
50 millimolar (mM) Tris-HCl, pH 8.2, 10 mM beta 
mercaptoethanol, 1 mM EDTA and 200 microgram/ml 
(/ig/ml) of lysozyme were admixed to the thawed cell 
paste. The admixture was thereafter maintained for 30 

30 minutes at room temperature. The maintained admixture 
was processed in a French press for two cycles. The 
cell lysate was sonicated for 10 minutes at room 
temperature and centrifuged at 16,000 RPM in a SA600 
rotor for 60 minutes at room temperature and the 

35 supernatant recovered. 
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B. Column Chromatography of Pf Cell Lvsate 

The supernatant prepared above was loaded on 
to a Q-sepharose (2.5 X 40 centimeter) column at room 
temperature. The column containing the cell lysate 
5 supernatant was then washed with 200 ml of column 
buffer (50 mM Tris-HCl, pH 8.2, 10 mM beta 
mercaptoethanol and 1 mM EDTA) . The column pass 
through and the washes were collected, pooled, and 
then centrifuged at 9000 X g in a Sorvall GS3 rotor at 

10 room temperature to remove any insoluble material. 

The resulting supernatant, containing partially 
purified Pyro polymerase, was recovered from the 
pellet and loaded directly onto a phosphocellulose 
column (2.5 X 40 cm) at room temperature. The column 

15 was washed with column buffer to remove any proteins 
that did not bind to the column until the optical 
density measured at an absorbance of 280 run dropped to 
baseline. The immobilized Pyro polymerase was 
thereafter eluted with a one liter linear gradient of 

20 NaCl ranging in concentration from 0 M to 0.7 M 

dissolved in column buffer and 10 ml fractions were 
collected. 

C. Assay for Pfu DNA Polymerase I Activity 
The collected fractions were separately 

25 assayed for Pyro polymerase activity. The following 
reagents were admixed to form a reaction cocktail for 
the measurement of Pyro polymerase activity; final 
concentrations (fc) of the reagents in the cocktail 
are in parentheses: 

30 (1) 200 microliters (Ml) active calf thymus 

DNA, 575 Ml distilled water, 20 Ml 1 M Tris-HCl, pH 
7.5 (fc=20 mM) ; 8 jil 1 K MgCl 2 (fc=8 mM) ; 

(2) 10 Ml 0*75 M DTT (fc=7.5 mM) ; 4 Ml 15 
mg/ml BSA (fc=50 ug/ml) ; 

35 (3) 30 Ml 1° each of dATP, dCTP, dGTP 
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dTTP (fc=0.15 mM for each); and 

(4) 50 Ml 3 H-TTP in ethanol 432A (Amersham 
Inc., Arlington Heights, IL) for a total volume of 1 
ml. 

5 To perform the Pyro polymerase (DNA Polymerase I) 

activity assay, 25 ul of the reaction cocktail formed 
above was admixed with 1 to 5 ul of each collected 
fraction. The admixture was maintained for 10 to 60 
minutes at 75 °C which was the optimal temperature for 

10 enzymatic activity to form a labelled DNA admixture. 

After the maintenance period, 2 to 5 ul of the labeled 
DNA admixture was pipetted onto DE81 (Whattman) filter 
paper. The filter paper containing the labeled DNA 
admixture was dried on aluminum foil under a heat lamp 

15 for 5 minutes. After the drying period, each filter 
was washed three times for 5 minutes with 50 ml of 2X 
SSC (0.3 M NaCl, 0.03 H NaCitrate) followed by one 
quick wash with 100% cold ethanol. The washed filters 
were immediately placed fresh aluminum foil and placed 

20 under a heat lamp for 5 minutes to dry the filters. 
The dried filters were separately placed in 
scintillation vials containing 5 ml scintillation 
fluid. 

The 3 H-DNA immobilized on the filter paper, which 
25 reflects Pyro polymerase activity, was measured in a 
scintillation counter. The results of this assay 
indicated that the peak fractions from the 
phosphocellulose column containing the highest 
concentration of Pyro polymerase were eluted with 200 
30 mM NaCl and that Pyro polymerase constituted about 10% 
of the total protein present in those fractions. 
D. FPLC Purification of Pvro Polymerase 

The fractions containing approximately 90% 
of the total DNA polymerase I activity as measured 
35 above were pooled and dialyzed against column buffer 
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overnight at 4°C to form a NaCl-free Pyro polymerase 
solution. The dialyzed salt-free Pyro polymerase 
solution was loaded onto a Mono S HR 5/5 FPLC (fast 
phase liquid chromatography) column (Pharmacia, 
5 Piscataway, NJ) previously equilibrated with the 

before-described column buffer. The Mono S column 
containing the Pyro polymerase was washed with about 
four column volumes of column buffer prior to elution 
to remove any proteins that did not bind to the 
10 column. The immobilized proteins were eluted with a 
one liter linear gradient of NaCl ranging in 
concentration from 0.0 M to 0.7 M dissolved in column 
buffer. 

Fractions were collected and assayed for the 

15 presence of FPLC purified Pyro polymerase activity as 
described , respectively, in B and C above. The 
results of this assay indicated that the peak 
fractions from the Mono S column containing the 
highest concentration of FPLC purif ied Pyro polymerase 

20 were eluted with 120 mM NaCl. The fractions 
containing 90% of the peak FPLC purified Pyro 
polymerase activity were pooled and dialyzed against 
the column buffer additionally containing 10% glycerol 
overnight at room temperature to form NaCl-free FPLC 

25 purified Pyro polymerase. 

The resultant purified and dialyzed Pyro 
polymerase was then subjected to a final purification 
on a 1.5 X 20 ci Matrix gel Blue A column (Amicon, 
Danvers, MA). The Matrix gel Blue A column was first 

30 equilibrated with the before-described column buffer 
containing 10% glycerol, 0.1% Tween 20 
(polyoxyethylenesorbitan monolaurate) and 0.1% non- 
idet P40 (octylohenol-ethyl ene oxide condensate 
containing an average of 9 moles ethylene oxide per 

35 mole of phenol) . The purified and dialyzed Pyro 



polymerase was then applied to the column using FPLC 
pimps. The column containing the Pyro polymerase 
sample was then washed with two column volumes of the 
glycerol-containing column buffer to remove any 
proteins that did not bind to the column. The 
immobilized Pyro polymerase was eluted from the column 
with a one liter linear gradient of KC1 ranging in 
concentration from 0.0 M to 0.7 M KC1. 

Eluted fractions from the Af f i-gel column were 
collected and assayed for the presence of purified 
Pyro polymerase activity as described in Examples 2B 
and C above. The fractions eluted with 200 to 300 mM 
KC1 contained the peak Pyro polymerase activity, with 
the optimal activity recovered at about 280 mM KC1. 
The peak fractions were pooled and concentrated 
through Centricon-30 columns which have a molecular 
weight cut-off at 30,000 kD (Amicon, Beverly, MA) to 
form a concentrated solution of purified Pyro 
polymerase. The purified Pyro polymerase was 
thereafter dialyzed against column buffer containing 
50% glycerol to form KCl-free purified Pyro 
polymerase. The resultant salt-free Pyro polymerase 
was determined to be about 95% homogeneous. 

3 . Molecular Weight Determination 

The molecular weight of the purified Pyro 
polymerase prepared in Example 2D was determined by 
SDS-PAGE under non-denaturing conditions according to 
the method of Laemmli et al., J. Mol. Biol. . (1973) 
80:575-599. Samples of Pyro polymerase, Taq 
polymerase, phosphorylase B, and bovine serum albumin, 
were applied to a 6-18% gradient, 1 ram thick, SDS- 
polyacrylamide gel (No vex, Encinitas, CA) and 
electrophoresed in a running buffering containing 1% 
SDS, 2.4 mM Tris, and 18 mM Glycine. The results of 
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that analysis, shown in Figure 1, indicate that Pyro 
polymerase migrates faster than phosphorylase B 
(Sigma, St. Louis MO; molecular weight 97 , 200 daltons) 
and Tag polymerase (Perkin-Elmer Cetus, Norwalk, CT; * 
5 94,000 daltons), but slower than BSA (Sigma, 66,000 

daltons) . Because of its proximity to Taq, Pyro , 
polymerase was assigned a relative molecular weight of 
90,000-93,000 daltons. 

10 4. Fidelity Assays 

Various assays were performed to complete the 
characterization of DNA Polymerase I purified from L. 
furiosus . In the assays described below, the Pyro 
polymerase was compared to the commercially available 

15 and well characterized Thermus a ouaticus (Taq) DNA 
polymerase (U-S. Patent No. 4,889,818). 

To determine the error rate of Pyro polymerase, 
fidelity assays by PCR were performed with the Pyro 
polymerase in an assay procedure generally described 

20 by Kohler et al, Proc. Natl. Acad. Sci. USA . 88:7958- 
7962 (1991), in which in vivo mutagenesis was 
monitored during PCR amplification of transgenic mouse 
genomic DNA containing 33 copies per cell of the 
lacIOZa transgene. The entire lac I gene plus the 

25 alpha-complementing fragment of the beta-galactosidase 
gene was amplified (30 to 40 rounds) by PCR to form 
amplified DNA. The amplified 1.9 kb DNA was then 
cloned into the EcoR I site of lambda gtlO and plated 
on host strain DHSalpha (lacZAMIS) which contains the 

30 alpha fragment of beta-galactosidase. 

The complementation of the two proteins resulted 
in enzymically-active beta-galactosidase, which was 
detected as blue plagues when X-gal (5-bromo-4-chloro- 
3-indolyl-beta-D-galactosidase) was present as a 
35 substrate. In contrast, a functional non-mutant lac I 



repressed the expression of beta-galactosidase which 
resulted in white plagues. The mutation frequency in 
lac I was defined as the proportion of mutant blue 
plagues to the total number of plaques scored. The 
error rate was then calculated by the formula E = 
2(mf/d) where mf was the observed mutation frequency 
in the PCR product (lac I) and d was the number of 
doublings according to Saiki et al.. Science . 239:487- 
491 (1988) . 

Results from these studies indicate that Pyro 
polymerase has an error rate (mutations per nucleotide 
per PGR cycle) as low as 1 X 10" 6 compared to 1 X 10" 5 
for Taq DNA polymerase as described by Eckert et al., 
Nucl. Acids. Res .. 18:3739-3744 (1990). Thus Pyro 
polymerase exhibits about a ten-fold greater 
replication fidelity than Tag DNA polymerase. 

5. Exonuclease 3' to 5' Activit y Assays 

Purified Pyro polymerase, prepared as in Example 
2D, was assayed to measure its 3' to 5' exonuclease 
activity. The exonuclease assay was conducted 
according to Chase et al, J.Biol.Chem. . 249:4545-4552 
(1974), except that the reaction was at 72 °C instead 
of 37 °C and the substrate was Taq I -digested lambda 
DAN filled in with tritium labelled dCTP and dGTP. 
Briefly, the following reagents were admixed to form a 
reaction cocktail for the measurement of 3 1 to 5 1 
exonuclease activity; final concentrations (fc) of the 
reagents in the cocktail are in parentheses: 

(1) 40 Ml 1 M Tris-HCl, pH 7.5 (fc=40 mM) ; 

(2) 10 Ml 1 M MgCl 2 (fc=10 mM); 

(3) 13.3 Ml 0.75 M DTT (fc=10 mM) ; 

w (4) 100 Ml labelled Taq 1 cut lambda DNA 
filled in with Sequenase™ and labelled with *H dCTP 
and ^dGTP obtained from Amersham, Arlington Heights, 
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(5) 636.7 /il distilled water for a total 
volume of 1 ml. 

For the 3 r to 5 • exonuclease activity assay, 20 
5 to 25 Ml of the prepared reaction cocktail was admixed 
with either Pyro or Taq DNA polymerase. The admixture 
was maintained for 10 to 60 minutes at 72 °C to form 
hydrolyzed lambda DNA, specifically ^-S 1 phosphate 
mononucleotides . The reaction in each admixture was 

10 terminated by admixing 5 Ml of 15 mg/ml bovine serum 
albumin (BSA) and 13 Ml of 50% trichloroacetic acid 
(TCA) and maintaining the admixture on ice for 15 
minutes. The terminated reaction admixture was then 
centrifuged at 12,000 X g for 15 minutes to pellet the 

15 unhydrolyzed intact lambda DNA. 

The resultant supernatant containing the 3 H- 
5 • exonuclease-derived phosphate mononucleotides was 
removed from the pellet. Forty Ml of each supernatant 
was admixed with 80 ul distilled water and 1 ml 

20 scintillation fluid. The amount of 3 H radioactivity 
detected by scintillation counting was a relative 
measure of the exonuclease activity of the Pyro and 
Taq DNA polymerase preparations. 

The results of this study show that Pyro 

25 polymerase exhibits detectable 3' to 5' exonuclease 
activity, whereas Tag polymerase does not. 

To determine the amount of non-specific nuclease 
activity present in the Pyro polymerase preparation, 
Pyro polymerase prepared in Example 2D was admixed 

30 with 20 to 25 jil of a reaction cocktail ♦ The 

following reagents were admixed to form the reaction 
cocktail for the measurement of non-specific nuclease 
activity; final concentrations (fc) of the reagents in 
the cocktail are in parentheses: 

35 (1) 40 Ml 1 M Tris-Hcl, pH 7.5 (fc=40 mM) ; 



(2) 10 /il 1 M MgCl 2 (fc=10 mM); 

(3) , 13.3 til 0.75 M DTT (fc=10 mM) ; 

(4) 100 Ml 3 H-labelled E. coli chromosomal 
DNA (sheared ten times through a 21 gauge needle) ; and 

(5) 835.7 /xl distilled water for a final 
volume of 1 ml. 

The admixture containing the reaction cocktail 
and the homogenized gel were maintained for 10 to 60 
minutes at 75°C which was the optimal temperature for 
the enzyme to form hydrolyzed nucleic acid product. 
The reaction in the admixture was terminated and 
supernatant was assayed as described in Example 2C 
above. The results of this assay show that Pyro 
polymerase did not exhibit detectable non-specific 
nuclease activity. The 3' to 5' exonuclease activity 
by Pyro polymerase is, therefore, specific and not due 
to non-specific nuclease activity. 

Thus, pfu DNA polymerase I is a thermostable DNA 
polymerase that, unlike Tag DNA polymerase, possesses 
a 3' to 5 • exonucleases activity which enables pfu 
polymerases to proofread errors, and threrby exhibits 
a ten-fold greater fidelity during DNA synthesis 
reactions • 

6. PGR with Pyro Polymerase 

The specificity of Pyro polymerase was evaluated 
in PGR amplification of template DNA compared to that 
achieved with Taq DNA polymerase. To prepare the 
template DNA for the reactions, genomic DNA was first 
purified by phenol extraction and alcohol 
precipitation according to the procedures of Ausubel 
et al., Current Protocols in Molecular Biology . John 
Wiley and Sons (1987), from blood obtained by tail 
bleed from a transgenic mouse having about 1 to 2 
copies of a lambda transgene vector, which is referred 
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to as lambda transgene mouse genomic DNA. 

The lz-lambda polynucleotide primers with the 
used in the PCR amplifications were prepared by 
chemical synthesis using a model 381A polynucleotide 
5 synthesizer (Applied Biosystems Inc., Foster City, CA) 
according to the manufacturers instructions. 

A hybridization reaction admixture was formed by 
combining the following reagents in a sterile 0.5 ml 
microfuge tube: 
10 (1) 80 Ml of sterile, autoclaved water; 

(2) 10 Ml of 10X reaction buffer containing 
500 mM KC1, 100 mM Tris-HCl, pH8.3, 15 mM MgCl 2 , and 
0.1% sterile gelatin; 

(3) 8 Ml °f a solution containing 2.5 mM 
15 each of the deoxynucleotidetriphosphates (dNTP's) 

dGTP f dCTP, dTTP, and dATP; 

(4) l Ml of a solution containing 250 ng 
each of the two polynucleotide primers described 
above; and 

20 (5) 1 Ml of lambda transgene mouse genomic 

DNA. 

The hybridization reaction admixture was heated^ 
to 94 °C and maintained at 94 °C for 1 minute to 
denature the duplex genomic DNA present and form 

25 single-stranded templates. The admixture was then 

cooled to 54 °C and maintained for 2 minutes to allow 
hybridization to occur and form duplex DNA. The 
hybridized admixture was thereafter centrifuged in a 
microfuge at 12,000 X g for 10 seconds to collect 

30 condensation off the microfuge tube walls. To 
separate microfuge tubes, 0.5 ul of a solution 
containing 2.5 units of either Taq DNA polymerase 
(obtained from Perkin-Elmer Cetus, Norwalk, CT or from 
Stratagene, La Jolla, CA) or Pyro polymerase prepared 

35 in Example 2D was admixed to form a primer extension 



reaction admixture. Additional : separate primer 
extension reaction admixtures we e made by diluting 
the amount of Pyro polymerase from 1:1.5, 1:2, 1:2.5, 
1:3, 1:4 and 1:5 to determine the optimal 
concentration of DNA synthesis. (The 1:3 dilution 
represented about 1.5 unit of Pyro polymerase per 
microliter.) 

Each microfuge tube containing the above-prepared 
primer extension reaction admixture was overlayed with 
50 Ml of mineral oil and then place into a DNA Thermal 
Cycler (Perkin-Elmer Cetus) and subjected to the 
following temperature and time conditions: 1) 94 °C 
for 1 minute to denature duplex DNA; 2) cooled to 54 °C 
for 2 minutes to anneal the primers; and 3) heated to 
74 °C for 1.0 minute to activate the polymerase and 
maintained at 74 °C for 0.5 minutes to extend the 
annealed primers. The tubes were subjected to 30 
cycles of the above sequence according to the 
manufacturers instructions. The cycled tubes were 
then maintained at 72 °C for 10 minutes followed by 4°C 
for 12 hours. 

The contents of each primer extension reaction 
admixture were analyzed on a 6% polyacrylamide gel in 
IX TBE by loading 35 Ml of the admixture sample and 5 
Ml of 10X sample buffer onto an 8 centimeter gel, 
electrophoresing the gel at 100V for about one hour 
followed by. staining the electrophoresed gel with 
ethidium bromide to visualize the electrophoresed 
nucleic acids. 

The results of the electrophoresed PCR amplif ied 
lambda transgene mouse genomic DNA using either Pyro 
or Taq DNA polymerase indicted that the Taq from Cetus 
and Stratagene gave nr rly identical results. The 
results using the Pyro DNA polymerase to amplify 
genomic DNA indicate that it produces less background. 
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The 1:2.5 dilution of Pyro polymerase resulted in 
optimal PCR amplification. 

7. Large Scale Preparation of Pvrococcus furiosus 
5 DNA Polymerase I 

The following steps 1-11 are performed at room 
temperature: 

1. Thaw 1500 grams of frozen cell paste at room 
temperature. Add 4 volumes (6000 mis) of lysis buffer 

10 A. 

2. Resuspend the cells and incubate at room 
temperature for 30 minutes. Cycle the cells through a 
French press two times. 

3. Sonicate the product of step 2 for 10 

15 minutes at room temperature. Centrifuge the resulting 
lysate at 9K RPMs in a GS3 (Sorvall) rotor at room 
temperature for 60 minutes. 

4. Collect the supernatant (Fraction I) and 
load it directly onto a Q-sepharose (8 X 30 cm) 

20 column. Wash the loaded column with 4 volumes (6 
litres) of buffer B. 

5. Collect the pass-through and washes. Adjust 
the collected material to pH 6.0 with HC1, and 
centrifuge it at 9K in a GS3 rotor at room temperature 

25 for 60 minutes to remove the protease-containing 
precipitate. 

6. Collect the supernatant and adjust its pH to 
7.8 with NaOH (Fraction II) . Load it at 50 ml /minute 
directly onto a 500 ml radial flow P-ll 

30 (phosphocellulose) (Sepragen, San Leandra, CA) column 
equilibrated with buffer B. Wash the column with 1-2 
litres of buffer B until the OD^ of the wash is back 
to baseline. 

7. Elute DNA polymerase activity with a 0-0. 7M 
35 NaCl gradient in buffer B (2 X 2500 ml) . The peak of 
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polymerase activity elutes at approximately 0.2-0.4M 
NaCl. 

8. Assay and pool those fractions containing 
greater than 90% of the total polymerase activity. 

5 All subsequent steps are performed at 4°C. 

9. Dialyze the pooled material of step 8 
overnight against 200 volumes buffer B at 4°C. 

10. Remove the dialyzed Pyro polymerase and 
clarify it with a 30 minute centrifugation if 

10 necessary (Fraction III) . Load dialysate onto a 

heparin-agarose column (5 X 20 cm) equilibrated with 
buffer B. Wash with 1 litre buffer B and elute with a 
0-0. 7M NaCl gradient in buffer B (2 X 1.5 litres). 

11. Assay for polymerase activity, and pool the 
15 fractions containing greater than 90% of total 

activity. Dialyze the recovered material against 200 
volumes of buffer C. 

12. Remove the Pyro polymerase from dialysis 
(Fraction IV) and load it onto an Affigel Blue column 

20 (2.5 X 20 cm) equilibrated with buffer C. 

13. Wash the Affigel-Blue column with buffer C 
until the OD 280 approaches baseline and elute the 
Pyro polymerase with a 0-0. 7M KC1 gradient in buffer C 
( 2 x 1000 ml) . 

25 14. Assay for polymerase and 3* exonuclease 

activities. Analyze the active fractions by silver 
stained SDS-PAGE gels. Pool the pure Pyro polymerase 
(greater than 90% pure on a w/w basis) fractions based 
on visual analysis silver stained gel. 

30 15. Dialyze overnight against 200 volumes of 

buffer D at 4°C. Remove the dialysate (Fraction V) 
and load it onto an 1.5 x 30 cm Pll (phosphocellulose) 
column equilibrated with buffer D. Wash with buffer D 
until wash OD 280 is baseline. Elute the Pfu DNA Poll 

35 with a 2 X 250 ml linear gradient 0.0-0.7M kCl 
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prepared in buffer D. 

16 . Assay for polymerase and 3 1 exonuclease 
activities. Analyze the active fractions by silver 
stained SDS-PAGE gels. Pool the pure Pyro polymerase 

5 (greater than 95% pure on a w/w basis) fractions based 
on visual analysis silver stained gel. 

17. Pool and dialyze overnight at 4°C against 20 
volumes buff er E. Remove from dialysis (fraction VI) 
and store at -20°C. 

10 

BUFFERS 

Buffer A; (6 litres) 
50 mM Tris-Cl pH 8.2 
10 mM beta mercaptoethanol 
15 1 mM EDTA 

200 /jg/ml lysozyme 

Buffer Bi (20 litres) 
50 mM Tris-Cl pH 8.2 
20 10 mM beta mercaptoethanol 

1 mM EDTA 

Buffer Ct (6 litres) 
50 mM Tris-Cl pH 8.2 
25 10 % glycerol 

1 mM EDTA 
1 mM DTT 
0.1% tween 20 
0.1% nonidet P40 

30 

Buffer D:(2 litres) 
50 mM Tris-Cl pH 8.2 
10% glycerol 
1.0 mM EDTA 
35 io mM beta mercaptoethanol 
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0.1% tween 20 
0,1% nonidet P40 

fe?r E 

5 50 Tris Hcl pH 8.2 

0.1 mM EDTA 
1 mM DTT 
0.1% Tween 20 
0.1% NP 40 
10 50% glycerol 

RESINS 

1.5 LITRES Q-SEPHAROSE (PHARMACIA) 
700 MLS P-ll (WHATMANN) 
15 200 MLS HEPARIN-AGAROSE (BIORAO) 

200 ML AFFI6EL-BLUE (BIORAD) 

8. production of Pf u I 

Cell Growth : Pyrococcus furiosus (DSM 3638) was 

20 grown at 85-88 °C as closed static cultures in a medium 
containing maltose (5g/liter) , NH 4 C1 (1.25 g/liter) 
elemental sulfur (S°, 5 g/liter), Na 2 S (0.5 g/liter^,_ 
synthetic sea water (17), a vitamin mixture (18), 
FeCl 3 (25 mM) , NaW0 4 (lOinM) and yeast extract (1.0 

25 g/liter). Growth was monitored by direct cell count 

and by the increase in turbidity at 600 nm. For large 
scale cultures, sulfide was replaced with titanium 
(III) citrate, and S° was omitted which necessitated 
sparging with Ar. Two 20 liter cultures served as an 

30 innoculum for growth in a 400 liter fermenter where 

the cultures were maintained at 88 °C, bubbled with Ar 
(7.5 liters/min) and stirred (50 rpm) . Cells were 
harvested after approximately 20 hours (OD^-O.S) with 
a Sharpies centrifuge at 100 liters/hour. 

35 
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NOTE: The following procedures are performed at 25°C, 
unless otherwise stated* 

Cell Lysis : The night before lysis, 500 grams of 
frozen cell paste is transferred to a 2-8 C 
5 refrigerator. The next morning, the cells are 

transferred to a 4 liter stainless steel beaker. The 
cells are resuspended using 4 volumes (2000 ml) of 
lysis Buffer 8A. The cell suspension is then 
incubated at room temperature for 1.5 hours. The 
10 cells are lysed in the French Press using 2 passes at 
8K PSI. The lysate is then sonicated at room 
temperature for 10 minutes. 

Following sonication, the lysate is transferred 
to 400 ml bottles and spun for 60 minutes at 9K rpm at 
15 room temperature in the Sorvall RC-2B using a Sorvall 
GS3 rotor. The supernatant (Fraction I) is collected 
and the volume measured. 

Q-Sepharose Column: Fraction I is loaded 
directly onto a 8 x 30 cm radial flow Q-sepharose 
20 column (-1500 ml) pre-equilibrated in Buffer IB at a 
flow rate of approximately 50 ml/minute. The column 
is then washed with 4 column volumes (6000 ml) of 
Buffer SB- 
NOTE: It is important to carefully collect the 
25 pass-thru and wash fractions as they contain the Pfu 
polymerase enzyme 

The Q-Sepharose pass-thru and column washes are 
next combined as one pool (Fraction II) . 

BPA-1000 Precipitation : Bioprocessing aid 
30 BPA-1000 (TosoHaas) pilots are conducted on Fraction 
II to determine the appropriate volume required to 
precipitate cell debri and nucleic acids but not Pfu 
polymerase. 1.0 ml aliquots of Fraction II are placed 
in 8 tubes. The BPA-1000 is mixed thoroughly and 0, 
35 2, 4, 6, 8, 10, 12, 14, 16, 18 and 20 ml of BPA-1000 
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are added to the tubes respectively, mixed gently, 
incubated for 5 minutes, and spun in a microfuge for 5 
minutes. Carefully decant and save the supernatants . 
Samples of each tube are then evaluated using the 
5 gapped duplex polymerase assay. Evaluate the clarity 
of each supernatant and the firmness of each pellet. 
Based on the control tube (without BPA addition) , 
determine which concentration of BPA increases the 
clarity of the supernatant without sacrificing 

10 polymerase yield. The appropriate amount of BPA-1000 
is then added to Fraction II and stirred for 30 
minutes. The suspension is then transferred to 400 ml 
centrifuge bottles and spun at 9K rpm at room 
temperature (25°C) for 60 minutes in a Sorvall RC-2B 

15 using a Sorvall GS3 rotor. The supernatant (Fraction 
III) is collected and the volume measured. 

Fraction III is then adjusted to pH 7.5 with 1 N 
HC1 if necessary. 

P-ll Cellulose c aium^* Fraction III is loaded 

20 directly onto a 10 x 14 cm P-ll column ("1000 ml) 

pre-equilibrated in Buffer 8C at a flow rate of 
approximately 5 ml /minute. The column is washed with 
Buffer 8C until the OD^ approaches baseline (4 column 
volumes) • The column is next eluted with a 2 x 4000 

25 ml gradient (0 to 700 mM KC1) prepared in Buffer 8C. 
25 ml fractions are collected, every fifth tube, 
put-on and pass-thru are assayed for polymerase 
activity. The fractions containing the peak of Pfu 
Pol I activity are pooled and concentrated to 

30 approximately 200 ml using an Amicon model CH 2 

concentrator with a SIY30 membrane cartridge. Any 
remaining Pfu Pol I is washed from the concentrator by 
flushing the lines with an additional 300 ml Buffer 
8C. The concentrate and wash are combined (Fraction 

35 IV). 
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NOTE: The following procedures are performed at 

4°C. 

Fraction IV is next dialyzed overnight against 18 
liters of Buffer D. 
5 Hi-Load P rn-inirm: The following morning the 

dialysate is transferred to 400 ml centrifuge bottles 
and spun 9K rpm at 4°C for 60 minutes in a Sorvall 
RC-2B using the GS3 rotor. The supernatant (Fraction 
V) is recovered and the volume recorded. Fraction V 

10 is divided into two equal portions. The first portion 
is loaded directly onto a FPLC Hi-Load S column ("58 
ml) pre-equilibrated in Buffer D at a flow rate of 5.0 
ml/minute. The column is washed with Buffer 8D until 
the OD 280 approaches baseline. The column is next 

15 eluted with a 2 x 250 ml gradient (0 to 250 mM KC1) 

prepared in Buffer D. 5.0 ml fractions are collected, 
every third tube, put-on and pass-thru are assayed for 
polymerase activity. The above procedure is repeated 
for the second portion of Fraction V. The fractions 

20 containing the peak Pfu Pol I activity are pooled and 
dialyzed overnight against 2x4 liters of Buffer 8E 
at 4 C. The following morning, the dialysate is 
removed from dialysis and the volume recorded 
(Fraction VI) . 

25 Heparin Sepharose CL-6B Column : Fraction VI is 

loaded onto a 1.5 x 90 cm heparin sepharose CL-6B 
column (~ 159 ml) pre-equilibrated in Buffer 8E at a 
flow rate of 0.5 ml/minute. The column is washed with 
1000 ml Buffer 8E. The column is next eluted with a 2 

30 x 750 ml gradient (0 to 300 mM KC1) prepared in Buffer 
8E. 10 ml fractions are collected, every third tube, 
put-on and pass-thru are assayed for polymerase 
activity. A protein gel is also recommended for peak 
evaluation. The fractions containing the peak Pfu DNA 

35 polymerase activity are pooled and dialyzed overnight 
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against 2x4 liters of Buffer 8E. The following 
morning, the dialysate is removed from dialysis and 
the volume recorded (Fraction VII) , 

Affi-Gel Blue Column ; Fraction VII is loaded 
5 onto a 2.5 x 4.0 cm affi-gel blue column (~ 20 ml) 
pre-equilibrated in Buffer 8E at a flow rate of 0.5 
ml/minute. The column is washed with Buffer 8E until 
the OD m approaches baseline. The column is next 
eluted with a 2 x 500 ml linear gradient (0 to 250 mH 

10 KC1) prepared in Buffer 8E. 10 ml fractions are 

collected, every third tube, put-on and pass-thru are 
assayed for polymerase activity. A SDS-PAGE gel is 
also recommended for careful peak evaulation. The 
fractions containing the peak Pfu Pol I activity are 

15 pooled and dialyzed overnight against 1 liter final 
dialysis Buffer 8F. The following morning, the 
purified enzyme is removed from dialysis and 
transferred to -20°C storage and represents a purified 
pfu polymerase I enzyme of this invention. 

20 Buffer 8A ; Lysis Buffer 

50 mM Tris-Cl, pH 8.2 
1 mM EDTA 

10 mM b-mercaptoethanol 
200 mg/mllysozyme 

25 

Buffer 8B : Q-Sepharose Buffer 
50 mM Tris-Cl, pH 8.2 

1 mM EDTA 
10 mM b-mercaptoethanol 

30 

Buffer 8C z Phosphocellulose Buffer 
50 mM Tris-Cl, pH 7.5 

1 mM EDTA 
10 mM b-mer captoethano 1 



35 
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Buffer 8D : High Load S Buffer 

50 mM Tris-Cl, pH 7.5 

1 mM EDTA 

1 mM DTT 

5 10 % (v/v) glycerol 

0.1 % (v/v) NP-40 

0.1 % (v/v) Tween 20 

Buffer 8E : Af f igel Blue & Heparin Buffer 

10 50 mM Tris-Cl, pH 8.2 

1 mM EDTA 

1 mM DTT 

10 % (v/v) glycerol 

0.1 % ■> (v/v) NP-40 

15 0.1 % (v/v) Tween 20 

Buffer 8F : Final Dialysis Buffer 



50 


mM 


Tris- 


CI, pH 8.2 


0.1 


mM 


EOTA 




1 


mM 


DTT 




0.1 


% 


(V/V) 


NP-40 


0.1 


% 


(v/v) 


Tween 20 


50 


% 


(v/v) 


glycerol 



25 9. Cloning the Gene that Encodes Pvrococcus f uriosus 
(Pfu) DNA Polymerase I 

Pyrococcus furiosus (DSM 3638) was grown as 
described by Bryant et al, J. Biol. Chem. . 
264:5070-5079 (1989) , with an additional supplement of 

30 lOmM Na 2 W0 4 as described in the Examples. Following 

harvesting by centrifugation, genomic DNA was isolated 
from the biomass using Stratagene's genomic DNA 
isolation kit according to manuf acturer ' instructions. 
The DNA was then randomly sheared by several passages 

35 through an eighteen gauge needle and the fragments 



were separated by sucrose gradient centr if ligation. 
The size of the fragments present within the fractions 
of the sucrose gradient were next estimated by agarose 
gel electrophoresis. The fractions containing four to 
nine kilobase fragments were combined and ligated to 
EcoRl linkers and the resulting inserts were ligated 
into EcoRl cut Lambda Zap II vector (Stratagene, La 
Jolla, CA) to create a genomic Pvrococcus furiosus 
library. This library was plated with XLl-Blue E. 
coli (Stratagene) on LB plates. Plague lifts were 
performed on Duralose nylon filters (Stratagene) to 
isolate individual bacteriophage colonies containing a 
cloned insert. 

N-terminal amino acid sequence determination of 
pfu I purified as described in Example 2 was performed 
by the Wistar Institute. Briefly, partially purified 
protein was subjected to SDS-PAGE followed by 
electrotransfer to a nylon membrane. The band 
corresponding to Pfu polymerase was isolated and 
subjected to protein microsequencing. From this 
microsequencing analysis, the unambiguous sequence of 
the forty-eight N-terminal amino acids of the pfu 
polymerase I protein was determined. The 48 residues 
corresponds to residues 1 to 48 of SEQ ID NO l. 
Internal amino acid sequence analysis was also 
performed by the Wistar Institute on tryptic digested 
fragment of the Pfu polymerase protein. 

Based on the N-terminal 48 amino acid residue 
sequence information, a series of degenerate PGR 
oligonucleotide primers were designed in pairs which 
would produce a 94 basepair (bp) PGR product. The 94 
bp product corresponds to amino acid residues within 
the 48 residues. These oligonucleotides (23 and 18 
bases corresponding to the two ends of the 94-mer) 
were designed such that the 3 1 terminal 8 nucleotides 
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of every sequence possible based on the known amino 
acid residue sequence within the 48 residues was 
present as a separate oligonucleotide. The 23-mer 
corresponds to possible nucleotides at positions 224 
5 to 246, and the 18-mer corresponds to possible 

nucleotides at positions 300 to 317. Whenever there 
was a wobble position in the rest of the 
oligonucleotide primer, a T was used. The rationale 
for these substitutions was based on the fact that the 

10 GC content of pfu is considerably low and that T 

mismatches are most tolerated by Taq polymerase. In 
this way, four oligos were needed for each of the two 
primer positions. Each of these oligos contained some 
mismatched bases but no degenerate positions. 

15 The PCR was performed on pfu genomic DNA with all 

16 possible primer pairs. Following agarose gel 
electrophoresis, it was determined that 12 of the 16 
amplification reactions produced the expected 94bp PCR 
product. One of the reactions containing the 94bp 

20 product was subjected to direct cycle sequencing 
(Stratagene) using both PCR primers used in the 
amplification reaction as sequencing primers. A 53 
base sequence deduced from the DNA sequencing gel was 
found to corresponded 100% to the known N-terminal 

25 amino acid sequence between a pair of primers, thus 
confirming that the sequence was from the gene 
encoding Pfu DNA polymerase. A 53 base 
oligonucleotide probe containing this sequence was 
next synthesized, and having the nucleotide sequence 

30 shown in SEQ ID NO 2 from nucleotides 247 to 299. 

This oligonucleotide was then end labelled with 32 P 
and used to screen the Pfu genomic library. 
Approximately four Pyrococcus genomes were represented 
in the screened library. The probe was hybridized to 

35 plaque lifts at 42°C for two hours in QuikHyb 
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(Stratagene) and washed twice at room temperature for 
15 minutes each in 2X SSC 0.1% SDS. Thirteen putative 
clones were identified and the plaques cored and 
resuspended in 300ml SM. Ten ml of each lysate was 
5 transferred into IX Taq reaction buffer, heated to 
100°C for ten minutes, and 30 cycles of PCR was 
performed using the PCR primer pair which originally 
produced the expected 94 bp fragment (PCR was 
performed according to the procedure described in the 

10 GeneAmp kit [Cetus Perkin-Elmer] ) . Following 

electrophoretic analysis of the PCR products, three 
clones produced the expected 94 bp PCR product. These 
three lambda clones were excised into to a pBluescript 
plasmid (Stratagene) according to the manufacturer's 

15 protocol. The resulting transf ormants were then 

screened by PCR. PCR was performed on single colony 
transf ormants (100°C in IX Ta§ reaction buffer for 10 
minutes, then centrifuged briefly followed by 30 
cycles of PCR using the original primer pair and the 

20 procedure described in the GeneAmp kit. Three 

positive plasmid clones were identified from each 
excision and large scale plasmid preparations were 
performed on one colony from each clone. 

The three bonafide polymerase clones were next 

25 mapped. One had a relatively small 1500bp insert. 

The other two clone inserts were about 4500 basepairs; 
one containing about lOOObp of the Pfu polymerase 
sequence while the other clone contained the entire 
polymerase gene. The clone having the entire pfu 

30 polymerase I gene was named pF72. 

The insert of clone pF72 was sequenced on both 
strands using Sequenase™ (USB) and custom 
oligonucleotide primers using a primer walking 
strategy. The sequence of the polymerase gene is 

35 shown in SEQ ID NO 2 from nucleotide bases 224 to 



2548 , and consists of a 2265 bp DNA seqment encoding 
775 amino acids corresponding to a 90,113 Dalton 
protein* 

The Pfu gene can be cloned into an expression 
system for production of the recombinant protein. The 
complete coding region of the pfu polymerase I gene is 
PCR-amplif ied using Pfu polymerase to limit mutations 
and the PCR product product is ligated in reading 
frame into the vector pRSET (Invitrogen) . The ligated 
vector is transformed into an F' containing E. coli 
host strain and protein production is induced with a 
recombinant M13 phage expressing cloned T7 RNA 
polymerase under control of the lacUVS promoter. The 
expression vector is designed to introduce an affinity 
tail which can be used to facilitate purification of 
the recombinant protein. Following cell lysis and 
clarification of the crude supernatant, the 
recombinant Pfu polymerase protein is isolated in a 
single step by metal affinity chromatography. 

In another strategy, the coding region of the Pfu 
DNA polymerase gene is amplified with Pfu polymerase 
using primers which introduce a unique restriction 
site at the ends of the PCR product to facilitate 
cloning into a pBluescript vector. When cloned into 
the T3 orientation, the protein is expressed under 
control of the lacZ promoter. Following expression, 
recombinant Pfu polymerase is purified with a 
modification of the procedure used to purify the 
native enzyme. Briefly, a heat precipitation step is 
employed following cell lysis and clarification. The 
heat precipitation step denatures and precipitates the 
majority of E. coli host proteins but not the 
thermostable Pfu polymerase and the remaining soluble 
fraction is recovered and used as described before as 
a source for purification of the pfu polymerase I 
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protein. 

The foregoing is intended as illustrative of the 
present invention but not limiting. Numerous 
5 variations and modifications can be effected without 
departing from the true spirit and scope of the 
invention. 
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PARTIAL SEQUENCE LISTING 
(The remaining sequences are in the specification) 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Mathur, Eric A 

(ii) TITLE OF INVENTION: Purified Thermostable Pyrococcus 
Furiosus DNA Polymerase I 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Bingham and Fitting 

(B) STREET: 11230 Sorrento Valley Road, Suite 200 
. (C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92121 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 02-DEC-1991 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/620,568 

(B) FILING DATE: 03-DEC-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/657,073 

(B) FILING DATE: 19-FEB-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/776,552 

(B) FILING DATE: 14-0CT-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fitting, Thomas 

(B) REGISTRATION NUMBER: 34,163 

(C) REFERENCE/DOCKET NUMBER: STG0100P 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-587-3533 

(B) TELEFAX: 619-587-0574 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met He Leu Asp Val Asp Tyr He Thr Glu Glu Gly Lys Pro Val He 
1 5 ■ 15 

Are Leu Phe Lys Lys Glu Asn Gly Lys Phe Lys He Glu His Asp Arg 
20 25 30 

Thr Phe Arg Pro Tyr He Tyr Ala Leu Leu Arg Asp Asp Ser Lys He 
35 40 45 

Glu Glu Val Lys Lys He Thr Gly Glu Arg His Gly Lys He Val Arg 
50 55 60 

He Val Asp Val Glu Lys Val Glu Lys Lys Phe Leu Gly Lys Pro He 
65 70 75 80 

Thr Val Trp Lys Leu Tyr Leu Glu His Pro Gin Asp Val Pro Thr He 
85 90 95 

Arg Glu Lys Val Arg Glu His Pro Ala Val Val Asp He Phe Glu Tyr 
100 105 HO 

Asp He Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu He Pro 
115 120 125 

Met Glu Gly Glu Glu Glu Leu Lys He Leu Ala Phe Asp He Glu Thr 
130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Gly Lys Gly Pro He He Met He 
145 150 ^ 155 160 

Ser Tyr Ala Asp Glu Asn Glu Ala Lys Val He Thr Trp Lys Asn He 
165 170 175 
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Asp Leu Pro Tyr Val Glu Val Val Ser Ser Glu Arg Glu Met He Lys 
180 185 • 

Arg Phe Leu Arg He lie Arg Glu Lys Asp Pro Asp lie He Val Thr 

195 200 205 

Tyr Asn Gly Asp Ser Phe Asp Phe Pro Tyr Leu Ala Lys Arg Ala Glu 
210 215 220 

Lys Leu Gly He Lys Leu Thr He Gly Arg Asp Gly Ser Glu Pro Lys 
225 230 235 240 

Met Gin Arg He Gly Asp Met Thr Ala Val Glu Val Lys Gly Arg He 
245 250 255 

His Phe Asp Leu Tyr His Val lie Thr Arg Thr He Asn Leu Pro Thr 
260 265 

Tyr Thr Leu Glu Ala Val Tyr Glu Ala He Phe Gly Lys Pro Lys Glu 
275 280 285 

Lvs Val Tyr Ala Asp Glu He Ala Lys Ala Trp Glu Ser Gly Glu Asn 
290 295 300 

Leu Glu Arg Val Ala Lys Tyr Ser Met Glu Asp Ala Lys Ala Thr Tyr 
305 310 315 

Glu Leu Gly Lys Glu Phe Leu Pro Met Glu He Gin Leu Ser Arg Leu 
325 330 335 

Val Gly Gin Pro Leu Trp Asp Val Ser Arg Ser Ser Tbr Gly Asn Leu 
340 345 350 

Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Val Ala 
355 360 365 

Pro Asn Lys Pro Ser Glu Glu Glu Tyr Gin Arg Arg Leu Arg Glu Ser 
370 375 380 

Tyr Thr Gly Gly Phe Val Lys Glu Pro Glu Lys Gly Leu Trp Glu Asn 
385 390 395 400 

He Val Tyr Leu Asp Phe Arg Ala Leu Tyr Pro Ser He He lie Thr 
405 410 415 

His Asn Val Ser Pro . Asp Thr Leu Asn Leu Glu Gly Cys Lys Asn Tyr 
420 «5 *30 

Asp He Ala Pro Gin Val Gly His Lys Phe Cys Lys Asp He Pro Gly 
435 440 445 
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Phe He Pro Ser Leu Leu Gly His Leu Leu Glu Glu Arg Gin Lys He 
450 *55 460 

Lys Thr Lys Met Lys Glu Thr Gin Asp Pro He Glu Lys He Leu Leu 
465 470 475 480 

Asp Tyr Arg Gin Lys Ala He Lys Leu Leu Ala Asn Ser Phe Tyr Gly 
485 490 495 

Tyr Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu 
500 505 510 

Ser Val Thr Ala Trp Gly Arg Lys Tyr He Glu Leu Val Trp Lys Glu 
515 520 525 

Leu Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr He Asp Thr Asp Gly 
530 535 540 

Leu Tyr Ala Thr He Pro Gly Gly Glu Ser Glu Glu He Lys Lys Lys 
545 550 555 560 

Ala Leu Glu Phe Val Lys Tyr He Asn Ser Lys Leu Pro Gly Leu Leu 
565 570 575 

Glu Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys 
580 585 590 

Lys Arg Tyr Ala Val He Asp Glu Glu Gly Lys Val He Thr Arg Gly 
595 600 605 

Leu Glu He Val Arg Arg Asp Trp Ser Glu He Ala Lys Glu Thr Gin 
610 615 620 

Ala Arg Val Leu Glu Thr He Leu Lys His Gly Asp Val Glu Glu Ala 
625 630 635 640 

Val Arg He Val Lys Glu Val He Gin Lys Leu Ala Asn Tyr Glu He 
645 650 655 

Pro Pro Glu Lys Leu Ala He Tyr Glu Gin He Thr Arg Pro Leu His 
660 665 670 

Glu Tyr Lys Ala He Gly Pro His Val Ala Val Ala Lys Lys Leu Ala 
675 680 685 . 

Ala Lys Gly Val Lys He Lys Pro Gly Met Val He Gly Tyr He Val 
690 695 700 

Leu Arg Gly Asp Gly Pro He Ser Asn Arg Ala He Leu Ala Glu Glu 
705 710 715 720 

Tyr Asp Pro Lys Lys His Lys Tyr Asp Ala Glu Tyr Tyr He Glu Asn 
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725 730 735 

Gin Val Leu Pro Ala Val Leu Arg He Leu Glu Gly Phe Gly Tyr Arg 
740 - 745 750 

Lys Glu Asp Leu Arg Tyr Gin Lys Thr Arg Gin Val Gly Leu Thr Ser 
J , 755 760 . 765 

Trp Leu Asn He Lys Lys Ser 
770 77 5 

(2) INFORMATION FOR SEQ ID NO;2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

CA) NAME/KEY: 5'UTR 
(B) LOCATION: 1. .223 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; 224.. 2551 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 2552.. 3499 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCCTGGTCCT GGGTCCACAT ATATGTTCTT ACTCGCCTTT ATGAAGAATC CCCCAGTCGC 60 

TCTAACCTGG GTTATAGTGA CAAATCTTCC TCCACCACCG CCCAAGAAGG TTATTTCTAT 120 

CAACTCTACA CCTCCCCTAT TTTCTCTCTT ATGAGATTTT TAAGTATAGT TATAGAGAAG 180 

GTTTTATACT CCAAACTGAG TTAGTAGATA TCTGGGGAGC ATAATGATTT TAGATGTGGA 240 

TTACATAACT GAAGAAGGAA AACCTGTTAT TAGGCTATTC AAAAAAGAGA ACGGAAAATT 300 

TAAGATAGAG CATGATAGAA CTTTTAGACC ATACATTTAC GCTCTTCTCA GGGATGATTC 360 



WO 92/09689 



PCT/US91/09026 



67 



AAAGATTGAA 


GAAGTTAAGA AAATAACGGG 


GG AAAGG CAT 


GGAAAGATTG TGAGAATTG1 


y. on 


TGATGTAGAG 


AAGGTTGAGA AAAAGTTTCT 


CGGCAAGCCT 


a mm a mm j uhMm AAA o^R^p^H A 

ATTACCGTGT GGAAACTTTA 


A OA 

4oU 


TTTGGAACAT 


rCCCAAGATG 


TTCCCACTAT 


TAGAGAAAAA 


GTTAGAGAAC ATCCAGCAGT 


540 


TGTGGACATC 


TTCGAATACG 


ATATTCCATT 


9W%^^ A A A ^ A A 

TGCAAAGAGA 


TACCTCATCG ACAAAGGCUi 


ouu 


AATACCAATG 


GAGGGGGAAG 


AAGAGCTAAA 


GATTCTTGCC 


TTCGATATAG AAACCCTCTA 


660 


TCACGAAGGA 


GAAGAGTTTG 


GAAAAGGCCG 


AATTATAATG 


A flVM A AfWfl A PW^^ A A #W A A A A 

ATTAGTTATG CAGATGAAAA 


7ZO 


TGAAGCAAAG 


GTGATTACTT 


GGAAAAACAT 


AGATCTTCCA 


TACGTTGAGG TTGXATCAAG 


/oU 


CGAGAGAGAG 


ATGATAAAGA 


GATTTCTCAG 


GATTATCAGG 


GAGAAGGATC CTGACATTAT 


840 


AGTTACTTAT 


AATGGAGACT 


CATTCGACTT 


CCCATATTTA 


GCGAAAAGGG CAGAAAAACT 


900 


TGGGATTAAA 


TTAACCATTG 


GAAGAGATGG 


AAGCGAGCCC 


AAGATGCAGA GAATAGGCGA 


960 


TATGACGGCT 


GTAGAAGTCA AGGGAAGAAT 


ACATTTCGAC 


TTGTATCATG TAATAACAAG 


1020 


GACAATAAAT 


CTCCCAACAT ACACACTAGA 


GGCTGTATAT 


GAAGCAATTT TTGGAAAGCC 


1080 


AAAGGAGAAG 


GTATACGCCG ACGAGATAGC 


AAAAGCCTGG 


GAAAGTGGAG AGAACCTTGA 


1140 


GAGAGTTGCC 


AAATACTCGA TGGAAGATGC 


AAAGGCAACT 


TATGAACTCG GGAAAGAATT 


1200 


CCTTCCAATG 


GAAATTCAGC 


TTTCAAGATT 


AGTTGGACAA 


CCTTTATGGG ATGTTTCAAG 


1260 


GTCAAGGAGA 


GGGAACCTTG 


TAGAGTGGTT 


CTTACTTAGG 


AAAGCCTACG AAAGAAACGA 


1320 


AGTAGCTCCA 


AACAAGCGAA 


GTGAAGAGGA 


GTATCAAAGA 


AGGCTCAGGG AGAGCTACAC 


1380 


AGGTGGATTC 


GTTAAAGAGC 


GAGAAAAGGG 


GTTGTGGGAA 


AACATAGtAT ACCTAGATTT 


1440 


TAGAGCCCTA 


TATCCCTCGA TTATAATTAC 


CCACAATGTT 


TCTCCCGATA CTCTAAATCT 


1500 


TGAGGGATGC 


AAGAACTATG 


ATATCGCTCC 


TCAAGTAGGC 


A f+ A A A'P^ ^ MM M ^ » * f>f> A ^ AT* 

CACAAGTTCT GGAAGGAGAT 


I->bU 


CCCTGGTTTT 


ATACCAAGTC 


TCTTGGGACA 


TTTGTTAGAG 


^ AAA ^ A ^ AAA A ^ A »P^* A A ^ A ^ 

GAAAGACAAA AGATTAAGAC 




AAAAATGAAG 


GAAACTCAAG 


ATCCTATAGA 


AAAAATACTC 


CTTGACTATA GACAAAAAGC 


1680 


GATAAAACTC 


TTAGCAAATT 


CTTTCTACGG 


ATATTATGGC 


TATGCAAAAG CAAGATGGTA 


1740 


CTGTAAGGAG 


TGTGCTGAGA 


GCGTTACTGC 


CTGGGGAAGA 


AAGTACATCG AGTIAGTATG 


1800 


GAAGGAGCTC 


GAAGAAAAGT 


TTGGATTTAA 


AGTCCTCTAC 


ATTGACACTG ATGGTCTCTA 


1860 


TGCAACTATC 


CCAGGAGGAG AAAGTGAGGA 


AATAAAGAAA 


AAGGCTCTAG AATTTGTAAA 


1920 
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ATACATAAAT TCAAAGCTCC 


CTGGACTGCT 


AGAGCTTGAA TATGAAGGGT 


TTTATAAGAG 


1980 


GGGAXTCTTC GTTACGAAGA 


AGAGGTATGC 


AGTAATAGAT GAAGAAGGAA 


AAGTCATTAC 


2040 


TCGTGGTTTA GAGATAGTTA 


GGAGAGATTG 


GAGTGAAATT GCAAAAGAAA 


CTCAAGCTAG 


2100 


AGTTTTGGAG ACAATACTAA 


AAGACGGAGA 


TGTTGAAGAA GCTGTGAGAA 


TAGTAAAAGA 


2160 


AGTAATACAA AAGCTTGCCA 


ATTATGAAAT 


TCCACCAGAG AAGCTCGCAA 


TATATGAGCA 


2220 


GATAACAAGA CCATTACATG 


AGTATAAGGC 


GATAGGTCCT CACGTAGCTG 


TTGCAAAGAA 


2280 


ACTAGCTGCT AAAGGAGTTA 


AAATAAAGCC 


AGGAATGGTA ATTGGATACA 


TAGTACTTAG 


2340 


AGGCGATGGT CCAATTAGCA 


ATAGGGCAAT 


TCTAGCTGAG GAATACGATC 


CCAAAAAGGA 


2400 


CAAGTATGAC GCAGAATATT 


ACATTGAGAA 


CCAGGTTCTT CCAGCGGTAC 


TTAGGATATT 


/ £ ft 

2460 


GGAGGGATTT GGATACAGAA 


AGGAAGACCT 


CAGATACCAA AAGACAAGAC 


AAGTCGGCCT 


2520 


AACTTCCTGG CTTAACATTA 


AAAAATCCTA 


GAAAAGCGAT AGATATCAAC 


TTTTATTCTT 


2580 


TCTAACCTTT TTCTATGAAA 


GAAGAACTGA 


GCAGGAATTA CCAGTTCTTC 


CGTTATTTTA 


Of /.ft 

2640 


TGGGTAATTA AAAACCCATG 


CTCTTGGGAG 


AATCTTCGAA TAAAATCCCT 


AACTTCAGGC 


2700 


TTTGCTAAGT GAATAGAATA 


AACAACATCA 


CTCACTTCAA ACGCCTTCGT 


TAGAAATGGT 


2760 


CTATCTGCAT GCTTCTCTGG 


CTCGGAANNG 


GAGGATTCAT AACAACAGTA 


TCAAGATTCT 


2820 


CAGAGAATTG AGAAACATCA 


GAAACTTTGA 


CTTCTACAAC ATTTCTAACT 


TTGCAACTCT 


ft o e ft 

2880 


TCAAGATTTT CTAAAAGAAT 


TTTAACGGCC 


TCCTCGTCAA TTTCGACGAC 


GTAGATCTTT 


n ft a rt 

2940 


TTTGCTCCAA GCAGAGCCGG 


TCCAATGGAT 


AACACGCCTG TTCCCGCACC 


CAAGTCCGCT 


3000 


ACAATTTTTT CCTTGTATCT 


CCTAATGTAT 


AAGCAAGCGA AAGGAGAGXA 


GATGCTACCT 


O ft f ft 

3060 


TTCCGGGAGT TTTGTATTGC 


TCTAGCCAAG 


GTTTGGGATT TTTGAATCCT 


TTAACTCTGG 


*\ « O ft 

3120 


AAAGTATAAT TTCAAGCTCC 


TTCTTCTTCA 


TGACAGATGA AAAATTGTTT 


TGTCTCTTTT 


3180 


TAACTTTTAC AGAAATAACT 


GTCTGAAATT 


ATGACAACTC TTGACATTTT 


TACTTCATTA 


3240 


CCAGGGTAAT GTTTTTAAGT 


ATGAAATTXT 


TCTTTCATAG AGGAGGNNNN 


NNGTCCTCTC 


3300 


CTCGATTTCC TTGGTTGTGC 


TCCATATGAT 


AAGCTTCCAA AGTGGGTGTT 


CAGACTTTTA 


3360 


GACACTCAAA TACCAGACGA 


CAATGGTGTG 


CTCACTCAAG CCCCATATGG 


GTTGAGAAAA 


3420 


GTAGAAGCGG CACTACTCAG 


ATGCTTCCCC 


AGGAATGAGG TTGTTGTAGC 


TCNTCCCNGA 


3480 
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What is Claimed is: 

1. Purified thermostable Pvrococcu s furiosus 
DNA polymerase I having an amino-terminal amino acid 
residue sequence represented by the formula shown in 

5 SEQ ID NO 1 from residue 1 to residue 775. 

2. The polymerase of claim 1 wherein said 
polymerase migrates on a non-denaturing polyacrylamide 
gel faster than phosphorylase B and Tag polymerase and 
more slowly than bovine serum albumin and has an 

10 estimated molecular weight of 90,000-93,000 daltons 

when compared with a Tag polymerase standard assigned 
a molecular weight of 94,000 daltons. 

3. The polymerase of claim 1 that is isolated 
from Pvrococcus furiosus . 

15 4. The polymerase of claim 1 that is isolated 

from a recombinant organism transformed with a vector 
that codes for the expression of Pyrococcus furiosus 
DNA polymerase. 

5. A plasmid containing a gene coding for a 

20 Pyrococcus furiosus DNA polymerase I having an amino- 
terminal amino acid residue sequence represented by 
the formula shown in SEQ ID NO 1 from residue 1 to 
residue 775. 

6. The plasmid of claim 5 wherein said 

25 polymerase has a molecular weight of about 90,000 to 
93,000 daltons as determined by SDS-PAGE under non- 
reducing conditions using Taq polymerase as a 
molecular weight marker having an assigned molecular 
weight of 94,000 daltons. 

30 7. The plasmid of claim 5 wherein said gene is 

operably linked to a promoter. 

8. A procaryotic cell transformed with the 
plasmid of claim 5. 

9. A bacteriophage containing a gene coding for 
35 a pyrococcus furiosus DNA polymerase I having an 
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amino- terminal amino acid residue sequence represented 
by the formula shown in SEQ ID NO 1 from residue 1 to 
residue 775. 

10. The bacteriophage of claim 9 wherein said 
5 polymerase has a molecular weight of about 90,000 to 
93,000 daltons as determined by SDS-PAGE under non- 
denaturing conditions using Taq polymerase as a 
molecular weight marker having an assigned molecular 
weight of 94,000 daltons. 
10 11. The bacteriophage of claim 9 wherein said 

gene is operatively linked to a promoter. 
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